SLIM: Sparsified late interaction for multi-vector retrieval with inverted indexes

A Abdallah, A Jatowt - arXiv preprint arXiv:2307.11278, 2023 - arxiv.org

Open-domain question answering (QA) tasks usually require the retrieval of relevant
information from a large corpus to generate accurate answers. We propose a novel …

被引用次数：20 相关文章所有 2 个版本

[PDF] arxiv.org

Sprint: A unified toolkit for evaluating and demystifying zero-shot neural sparse retrieval

N Thakur, K Wang, I Gurevych, J Lin - Proceedings of the 46th …, 2023 - dl.acm.org

Traditionally, sparse retrieval systems relied on lexical representations to retrieve
documents, such as BM25, dominated information retrieval tasks. With the onset of pre …

被引用次数：3 相关文章所有 4 个版本

[PDF] arxiv.org

Generative retrieval as multi-vector dense retrieval

S Wu, W Wei, M Zhang, Z Chen, J Ma, Z Ren… - Proceedings of the 47th …, 2024 - dl.acm.org

For a given query generative retrieval generates identifiers of relevant documents in an end-
to-end manner using a sequence-to-sequence architecture. The relation between …

被引用次数：2 相关文章所有 4 个版本

[PDF] acm.org

Distillation for Multilingual Information Retrieval

E Yang, D Lawrie, J Mayfield - Proceedings of the 47th International ACM …, 2024 - dl.acm.org

Recent work in cross-language information retrieval (CLIR), where queries and documents
are in different languages, has shown the benefit of the Translate-Distill framework that …

被引用次数：2 相关文章所有 3 个版本

[PDF] github.io

Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses

E Kamalloo, N Thakur, C Lassance, X Ma… - Proceedings of the 47th …, 2024 - dl.acm.org

BEIR is a benchmark dataset originally designed for zero-shot evaluation of retrieval models
across 18 different domain/task combinations. In recent years, we have witnessed the …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Resources for brewing BEIR: reproducible reference models and an official leaderboard

E Kamalloo, N Thakur, C Lassance, X Ma… - arXiv preprint arXiv …, 2023 - arxiv.org

BEIR is a benchmark dataset for zero-shot evaluation of information retrieval models across
18 different domain/task combinations. In recent years, we have witnessed the growing …

被引用次数：10 相关文章所有 2 个版本

[PDF] acm.org

Balanced Knowledge Distillation with Contrastive Learning for Document Re-ranking

Y Yang, S He, Y Qiao, W Xie, T Yang - Proceedings of the 2023 ACM …, 2023 - dl.acm.org

Knowledge distillation is commonly used in training a neural document ranking model by
employing a teacher to guide model refinement. As a teacher may not be correct in all cases …

被引用次数：2 相关文章所有 2 个版本

[PDF] acm.org

Splate: Sparse late interaction retrieval

T Formal, S Clinchant, H Déjean… - Proceedings of the 47th …, 2024 - dl.acm.org

The late interaction paradigm introduced with ColBERT stands out in the neural Information
Retrieval space, offering a compelling effectiveness-efficiency trade-off across many …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

End-to-End Retrieval with Learned Dense and Sparse Representations Using Lucene

H Chen, C Lassance, J Lin - arXiv preprint arXiv:2311.18503, 2023 - arxiv.org

The bi-encoder architecture provides a framework for understanding machine-learned
retrieval models based on dense and sparse vector representations. Although these …

被引用次数：4 相关文章所有 2 个版本

[PDF] acm.org

Weighted KL-Divergence for Document Ranking Model Refinement

Y Yang, Y Qiao, S He, T Yang - … of the 47th International ACM SIGIR …, 2024 - dl.acm.org

Transformer-based retrieval and reranking models for text document search are often
refined through knowledge distillation together with contrastive learning. A tight distribution …