Information retrieval: recent advances and beyond

KA Hambarde, H Proenca - IEEE Access, 2023 - ieeexplore.ieee.org
This paper provides an extensive and thorough overview of the models and techniques
utilized in the first and second stages of the typical information retrieval processing chain …

A proposed conceptual framework for a representational approach to information retrieval

J Lin - ACM SIGIR Forum, 2022 - dl.acm.org
This paper outlines a conceptual framework for understanding recent developments in
information retrieval and natural language processing that attempts to integrate dense and …

A survey for efficient open domain question answering

Q Zhang, S Chen, D Xu, Q Cao, X Chen, T Cohn… - arXiv preprint arXiv …, 2022 - arxiv.org
Open domain question answering (ODQA) is a longstanding task aimed at answering factual
questions from a large knowledge corpus without any explicit evidence in natural language …

Introducing neural bag of whole-words with colberter: Contextualized late interactions using enhanced reduction

S Hofstätter, O Khattab, S Althammer… - Proceedings of the 31st …, 2022 - dl.acm.org
Recent progress in neural information retrieval has demonstrated large gains in quality,
while often sacrificing efficiency and interpretability compared to classical approaches. We …

Are we there yet? A decision framework for replacing term based retrieval with dense retrieval systems

S Hofstätter, N Craswell, B Mitra, H Zamani… - arXiv preprint arXiv …, 2022 - arxiv.org
Recently, several dense retrieval (DR) models have demonstrated competitive performance
to term-based retrieval that are ubiquitous in search systems. In contrast to term-based …

Domain adaptation for memory-efficient dense retrieval

N Thakur, N Reimers, J Lin - arXiv preprint arXiv:2205.11498, 2022 - arxiv.org
Dense retrievers encode documents into fixed dimensional embeddings. However, storing
all the document embeddings within an index produces bulky indexes which are expensive …

Understanding and mitigating the threat of vec2text to dense retrieval systems

S Zhuang, B Koopman, X Chu, G Zuccon - Proceedings of the 2024 …, 2024 - dl.acm.org
The emergence of Vec2Text---a method for text embedding inversion---has raised serious
privacy concerns for dense retrieval systems which use text embeddings, such as those …

Query expansion using contextual clue sampling with language models

L Liu, M Li, J Lin, S Riedel, P Stenetorp - arXiv preprint arXiv:2210.07093, 2022 - arxiv.org
Query expansion is an effective approach for mitigating vocabulary mismatch between
queries and documents in information retrieval. One recent line of research uses language …

[PDF][PDF] Injecting domain adaptation with learning-to-hash for effective and efficient zero-shot dense retrieval

N Thakur, N Reimers, J Lin - arXiv: 2205.11498, 2022 - reneuir.org
Dense retrieval overcome the lexical gap and has shown great success in ad-hoc
information retrieval (IR). Despite their success, dense retrievers are expensive to serve …

An encoder attribution analysis for dense passage retriever in open-domain question answering

M Li, X Ma, J Lin - Proceedings of the 2nd Workshop on …, 2022 - aclanthology.org
The bi-encoder design of dense passage retriever (DPR) is a key factor to its success in
open-domain question answering (QA), yet it is unclear how DPR's question encoder and …