LLMs Can Patch Up Missing Relevance Judgments in Evaluation S Upadhyay, E Kamalloo, J Lin arXiv preprint arXiv:2405.04727, 2024 | 7 | 2024 |
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor S Upadhyay, R Pradeep, N Thakur, N Craswell, J Lin arXiv preprint arXiv:2406.06519, 2024 | 4 | 2024 |
Regenerating vital facial keypoints for impostor identification from disguised images using CNN J Mehta, S Talati, S Upadhyay, S Valiveti, G Raval Expert Systems with Applications 219, 119669, 2023 | 4 | 2023 |
Towards robust qa evaluation via open llms E Kamalloo, S Upadhyay, J Lin Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024 | 3 | 2024 |
UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models S Sharifymoghaddam, S Upadhyay, W Chen, J Lin arXiv preprint arXiv:2405.10311, 2024 | 3 | 2024 |
A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look S Upadhyay, R Pradeep, N Thakur, D Campos, N Craswell, I Soboroff, ... arXiv preprint arXiv:2411.08275, 2024 | 2 | 2024 |
Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework R Pradeep, N Thakur, S Upadhyay, D Campos, N Craswell, J Lin arXiv preprint arXiv:2411.09607, 2024 | | 2024 |