BEIR: A heterogenous benchmark for zero-shot evaluation of information retrieval models N Thakur, N Reimers, A Rücklé, A Srivastava, I Gurevych Proceedings of the Neural Information Processing Systems Track on Datasets …, 2021 | 730 | 2021 |
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks N Thakur, N Reimers, J Daxenberger, I Gurevych Proceedings of the 2021 Conference of the North American Chapter of the …, 2020 | 214 | 2020 |
GPL: Generative pseudo labeling for unsupervised domain adaptation of dense retrieval K Wang, N Thakur, N Reimers, I Gurevych arXiv preprint arXiv:2112.07577, 2021 | 137 | 2021 |
MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages X Zhang, N Thakur, O Ogundepo, E Kamalloo, D Alfonso-Hermelo, X Li, ... Transactions of the Association for Computational Linguistics 11, 1114-1131, 2023 | 70* | 2023 |
Hagrid: A human-llm collaborative dataset for generative information-seeking with attribution E Kamalloo, A Jafari, X Zhang, N Thakur, J Lin arXiv preprint arXiv:2307.16883, 2023 | 19 | 2023 |
Evaluating embedding APIs for information retrieval E Kamalloo, X Zhang, O Ogundepo, N Thakur, D Alfonso-Hermelo, ... arXiv preprint arXiv:2305.06300, 2023 | 16 | 2023 |
Injecting Domain Adaptation with Learning-to-hash for Effective and Efficient Zero-shot Dense Retrieval N Thakur, N Reimers, J Lin arXiv preprint arXiv:2205.11498, 2022 | 13* | 2022 |
NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation N Thakur, L Bonifacio, X Zhang, O Ogundepo, E Kamalloo, ... arXiv preprint arXiv:2312.11361, 2023 | 10 | 2023 |
Simple yet effective neural ranking and reranking baselines for cross-lingual information retrieval J Lin, D Alfonso-Hermelo, V Jeronymo, E Kamalloo, C Lassance, ... arXiv preprint arXiv:2304.01019, 2023 | 10 | 2023 |
Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses E Kamalloo, N Thakur, C Lassance, X Ma, JH Yang, J Lin Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024 | 7* | 2024 |
Leveraging llms for synthesizing training data across many languages in multilingual dense retrieval N Thakur, J Ni, GH Ábrego, J Wieting, J Lin, D Cer arXiv preprint arXiv:2311.05800, 2023 | 4 | 2023 |
SPRINT: A unified toolkit for evaluating and demystifying zero-shot neural sparse retrieval N Thakur, K Wang, I Gurevych, J Lin Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023 | 3 | 2023 |
Augmented SBERT: data augmentation method for improving bi-encoders for pairwise sentence scoring tasks 2020 N Thakur, N Reimers, J Daxenberger, I Gurevych arXiv preprint arXiv:2010.08240, 2021 | 3 | 2021 |
Systematic evaluation of neural retrieval models on the touché 2020 argument retrieval subset of beir N Thakur, L Bonifacio, M Fröbe, A Bondarenko, E Kamalloo, M Potthast, ... Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024 | 2 | 2024 |
Ragnar\" ok: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track R Pradeep, N Thakur, S Sharifymoghaddam, E Zhang, R Nguyen, ... arXiv preprint arXiv:2406.16828, 2024 | | 2024 |
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor S Upadhyay, R Pradeep, N Thakur, N Craswell, J Lin arXiv preprint arXiv:2406.06519, 2024 | | 2024 |
BWS Argument Similarity Corpus N Thakur, J Daxenberger, I Gurevych | | 2020 |