Dense text retrieval based on pretrained language models: A survey
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …
required to return relevant information resources to user's queries in natural language. From …
How to train your dragon: Diverse augmentation towards generalizable dense retrieval
Various techniques have been developed in recent years to improve dense retrieval (DR),
such as unsupervised contrastive learning and pseudo-query generation. Existing DRs …
such as unsupervised contrastive learning and pseudo-query generation. Existing DRs …
PLAID: an efficient engine for late interaction retrieval
Pre-trained language models are increasingly important components across multiple
information retrieval (IR) paradigms. Late interaction, introduced with the ColBERT model …
information retrieval (IR) paradigms. Late interaction, introduced with the ColBERT model …
Low-resource dense retrieval for open-domain question answering: A comprehensive survey
Dense retrieval (DR) approaches based on powerful pre-trained language models (PLMs)
achieved significant advances and have become a key component for modern open-domain …
achieved significant advances and have become a key component for modern open-domain …
Aggretriever: A simple approach to aggregate textual representations for robust dense passage retrieval
Pre-trained language models have been successful in many knowledge-intensive NLP
tasks. However, recent work has shown that models such as BERT are not “structurally …
tasks. However, recent work has shown that models such as BERT are not “structurally …
Scalable and effective generative information retrieval
Recent research has shown that transformer networks can be used as differentiable search
indexes by representing each document as a sequence of document ID tokens. These …
indexes by representing each document as a sequence of document ID tokens. These …
Led: Lexicon-enlightened dense retriever for large-scale retrieval
Retrieval models based on dense representations in semantic space have become an
indispensable branch for first-stage retrieval. These retrievers benefit from surging advances …
indispensable branch for first-stage retrieval. These retrievers benefit from surging advances …
Distillation from heterogeneous models for top-k recommendation
Recent recommender systems have shown remarkable performance by using an ensemble
of heterogeneous models. However, it is exceedingly costly because it requires resources …
of heterogeneous models. However, it is exceedingly costly because it requires resources …
Tevatron: An efficient and flexible toolkit for neural retrieval
Recent rapid advances in deep pre-trained language models and the introduction of large
datasets have powered research in embedding-based neural retrieval. While many …
datasets have powered research in embedding-based neural retrieval. While many …
Listwise generative retrieval models via a sequential learning process
Recently, a novel generative retrieval (GR) paradigm has been proposed, where a single
sequence-to-sequence model is learned to directly generate a list of relevant document …
sequence-to-sequence model is learned to directly generate a list of relevant document …