Query expansion techniques for information retrieval: a survey
With the ever increasing size of the web, relevant information extraction on the Internet with
a query formed by a few keywords has become a big challenge. Query Expansion (QE) …
a query formed by a few keywords has become a big challenge. Query Expansion (QE) …
Semantic models for the first-stage retrieval: A comprehensive review
Multi-stage ranking pipelines have been a practical solution in modern search systems,
where the first-stage retrieval is to return a subset of candidate documents and latter stages …
where the first-stage retrieval is to return a subset of candidate documents and latter stages …
Approximate nearest neighbor negative contrastive learning for dense text retrieval
Conducting text retrieval in a dense learned representation space has many intriguing
advantages over sparse retrieval. Yet the effectiveness of dense retrieval (DR) often requires …
advantages over sparse retrieval. Yet the effectiveness of dense retrieval (DR) often requires …
[图书][B] Pretrained transformers for text ranking: Bert and beyond
The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in
response to a query. Although the most common formulation of text ranking is search …
response to a query. Although the most common formulation of text ranking is search …
Query2doc: Query expansion with large language models
This paper introduces a simple yet effective query expansion approach, denoted as
query2doc, to improve both sparse and dense retrieval systems. The proposed method first …
query2doc, to improve both sparse and dense retrieval systems. The proposed method first …
COIL: Revisit exact lexical match in information retrieval with contextualized inverted list
Classical information retrieval systems such as BM25 rely on exact lexical match and carry
out search efficiently with inverted list index. Recent neural IR models shifts towards soft …
out search efficiently with inverted list index. Recent neural IR models shifts towards soft …
Asking clarifying questions in open-domain information-seeking conversations
Users often fail to formulate their complex information needs in a single query. As a
consequence, they may need to scan multiple result pages or reformulate their queries …
consequence, they may need to scan multiple result pages or reformulate their queries …
A deep look into neural ranking models for information retrieval
Ranking models lie at the heart of research on information retrieval (IR). During the past
decades, different techniques have been proposed for constructing ranking models, from …
decades, different techniques have been proposed for constructing ranking models, from …
PARADE: Passage Representation Aggregation forDocument Reranking
Pre-trained transformer models, such as BERT and T5, have shown to be highly effective at
ad hoc passage and document ranking. Due to the inherent sequence length limits of these …
ad hoc passage and document ranking. Due to the inherent sequence length limits of these …
Anserini: Enabling the use of lucene for information retrieval research
Software toolkits play an essential role in information retrieval research. Most open-source
toolkits developed by academics are designed to facilitate the evaluation of retrieval models …
toolkits developed by academics are designed to facilitate the evaluation of retrieval models …