Information retrieval: recent advances and beyond

KA Hambarde, H Proenca - IEEE Access, 2023 - ieeexplore.ieee.org
This paper provides an extensive and thorough overview of the models and techniques
utilized in the first and second stages of the typical information retrieval processing chain …

Dense text retrieval based on pretrained language models: A survey

WX Zhao, J Liu, R Ren, JR Wen - ACM Transactions on Information …, 2024 - dl.acm.org
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …

Lexlip: Lexicon-bottlenecked language-image pre-training for large-scale image-text sparse retrieval

Z Luo, P Zhao, C Xu, X Geng, T Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Image-text retrieval (ITR) aims to retrieve images or texts that match a query originating from
the other modality. The conventional dense retrieval paradigm relies on encoding images …

Fine-grained distillation for long document retrieval

Y Zhou, T Shen, X Geng, C Tao, J Shen… - Proceedings of the …, 2024 - ojs.aaai.org
Long document retrieval aims to fetch query-relevant documents from a large-scale
collection, where knowledge distillation has become de facto to improve a retriever by …

Towards robust ranker for text retrieval

Y Zhou, T Shen, X Geng, C Tao, C Xu, G Long… - arXiv preprint arXiv …, 2022 - arxiv.org
A ranker plays an indispensable role in the de facto'retrieval & rerank'pipeline, but its
training still lags behind--learning from moderate negatives or/and serving as an auxiliary …

Towards Effective and Efficient Sparse Neural Information Retrieval

T Formal, C Lassance, B Piwowarski… - ACM Transactions on …, 2024 - dl.acm.org
Sparse representation learning based on Pre-trained Language Models has seen a growing
interest in Information Retrieval. Such approaches can take advantage of the proven …

Retrieval-based Disentangled Representation Learning with Natural Language Supervision

J Zhou, X Li, L Shang, X Jiang, Q Liu… - The Twelfth International …, 2024 - openreview.net
Disentangled representation learning remains challenging as the underlying factors of
variation in the data do not naturally exist. The inherent complexity of real-world data makes …

Towards a Unified Framework for Reference Retrieval and Related Work Generation

Z Shi, S Gao, Z Zhang, X Chen, Z Chen… - Findings of the …, 2023 - aclanthology.org
The task of related work generation aims to generate a comprehensive survey of related
research topics automatically, saving time and effort for authors. Existing methods simplify …

TriSampler: A Better Negative Sampling Principle for Dense Retrieval

Z Yang, Z Shao, Y Dong, J Tang - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Negative sampling stands as a pivotal technique in dense retrieval, essential for training
effective retrieval models and significantly impacting retrieval performance. While existing …

GLEN: Generative retrieval via lexical index learning

S Lee, M Choi, J Lee - arXiv preprint arXiv:2311.03057, 2023 - arxiv.org
Generative retrieval shed light on a new paradigm of document retrieval, aiming to directly
generate the identifier of a relevant document for a query. While it takes advantage of …