Which Diversity Evaluation Measures Are" Good"?
This study evaluates 30 IR evaluation measures or their instances, of which nine are for
adhoc IR and 21 are for diversified IR, primarily from the viewpoint of whether their …
adhoc IR and 21 are for diversified IR, primarily from the viewpoint of whether their …
Offline evaluation by maximum similarity to an ideal ranking
CLA Clarke, MD Smucker, A Vtyurina - Proceedings of the 29th ACM …, 2020 - dl.acm.org
NDCG and similar measures remain standard for the offline evaluation of search,
recommendation, question answering and similar systems. These measures require …
recommendation, question answering and similar systems. These measures require …
Preference-based offline evaluation
A core step in production model research and development involves the offline evaluation of
a system before production deployment. Traditional offline evaluation of search …
a system before production deployment. Traditional offline evaluation of search …
[图书][B] Task intelligence for search and recommendation
While great strides have been made in the field of search and recommendation, there are
still challenges and opportunities to address information access issues that involve solving …
still challenges and opportunities to address information access issues that involve solving …
Ranking Interruptus: When Truncated Rankings Are Better and How to Measure That
Most of information retrieval effectiveness evaluation metrics assume that systems
appending irrelevant documents at the bottom of the ranking are as effective as (or not …
appending irrelevant documents at the bottom of the ranking are as effective as (or not …
Principled multi-aspect evaluation measures of rankings
Information Retrieval evaluation has traditionally focused on defining principled ways of
assessing the relevance of a ranked list of documents with respect to a query. Several …
assessing the relevance of a ranked list of documents with respect to a query. Several …
Component-based Analysis of Dynamic Search Performance
In many search scenarios, such as exploratory, comparative, or survey-oriented search,
users interact with dynamic search systems to satisfy multi-aspect information needs. These …
users interact with dynamic search systems to satisfy multi-aspect information needs. These …
Dynamic diversification for interactive complex search
A Albahem - Advances in Information Retrieval: 41st European …, 2019 - Springer
Many real-world searches are examples of complex information needs such as exploratory,
comparative or survey oriented searches. In these search scenarios, users engage …
comparative or survey oriented searches. In these search scenarios, users engage …
[PDF][PDF] 10. Test collection incompleteness and un-judged documents
1 INTRODUCTION IR test collections are notoriously incomplete: not all documents are
assessed, and many more documents are assessed as non-relevant than as relevant. This …
assessed, and many more documents are assessed as non-relevant than as relevant. This …