Bootstrapped nDCG Estimation in the Presence of Unjudged Documents

S MacAvaney, L Soldaini - Proceedings of the 46th International ACM …, 2023 - dl.acm.org

Dealing with unjudged documents (" holes") in relevance assessments is a perennial
problem when evaluating search systems with offline experiments. Holes can reduce the …

被引用次数：47 相关文章所有 4 个版本

[PDF] acm.org

Evaluating generative ad hoc information retrieval

L Gienapp, H Scells, N Deckers, J Bevendorff… - Proceedings of the 47th …, 2024 - dl.acm.org

Recent advances in large language models have enabled the development of viable
generative retrieval systems. Instead of a traditional document ranking, generative retrieval …

被引用次数：14 相关文章所有 4 个版本

[PDF] acm.org

Perspectives on large language models for relevance judgment

G Faggioli, L Dietz, CLA Clarke, G Demartini… - Proceedings of the …, 2023 - dl.acm.org

When asked, large language models~(LLMs) like ChatGPT claim that they can assist with
relevance judgments but it is not clear whether automated judgments can reliably be used in …

被引用次数：18 相关文章所有 10 个版本

[PDF] arxiv.org

Query performance prediction using relevance judgments generated by large language models

C Meng, N Arabzadeh, A Askari, M Aliannejadi… - arXiv preprint arXiv …, 2024 - arxiv.org

Query performance prediction (QPP) aims to estimate the retrieval quality of a search system
for a query without human relevance judgments. Previous QPP methods typically return a …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

LLMs Can Patch Up Missing Relevance Judgments in Evaluation

S Upadhyay, E Kamalloo, J Lin - arXiv preprint arXiv:2405.04727, 2024 - arxiv.org

Unjudged documents or holes in information retrieval benchmarks are considered non-
relevant in evaluation, yielding no gains in measuring effectiveness. However, these missing …

被引用次数：10 相关文章所有 2 个版本

[PDF] ru.nl

[PDF][PDF] Team openwebsearch at CLEF 2024: QuantumCLEF

M Frobe, D Alexander, GAW Hendriksen, F Schlatt… - 2024 - repository.ubn.ru.nl

We describe the OpenWebSearch group's participation in the CLEF 2024 QuantumClef IR
Feature Selection track. Our submitted runs focus on the observation that the importance of …

被引用次数：2 相关文章所有 3 个版本

[PDF] ru.nl

[PDF][PDF] Team openwebsearch at clef 2024: Longeval

D Alexander, M Frobe, GAW Hendriksen, F Schlatt… - 2024 - repository.ubn.ru.nl

We describe the OpenWebSearch group's participation in the CLEF 2024 LongEval IR track.
Our submitted runs explore how historical data from the past can be transferred into future …

被引用次数：1 相关文章所有 4 个版本

[PDF] ru.nl

[PDF][PDF] Open Web Search at LongEval 2023: Reciprocal Rank Fusion on Automatically Generated Query Variants

M Fröbe, G Hendriksen, AP de Vries, M Potthast - 2023 - repository.ubn.ru.nl

We describe the participation of the Open Web Search (OWS) team in the shared task
LongEval hosted at CLEF 2023. Our submission is motivated by previous observations on …

被引用次数：1 相关文章所有 3 个版本