HC4: A new suite of test collections for ad hoc CLIR

D Lawrie, J Mayfield, DW Oard, E Yang - European Conference on …, 2022 - Springer
HC4 is a new suite of test collections for ad hoc Cross-Language Information Retrieval
(CLIR), with Common Crawl News documents in Chinese, Persian, and Russian, topics in …

On the evaluation of machine-generated reports

J Mayfield, E Yang, D Lawrie, S MacAvaney… - Proceedings of the 47th …, 2024 - dl.acm.org
Large Language Models (LLMs) have enabled new ways to satisfy information needs.
Although great strides have been made in applying them to settings like document ranking …

HC3: A suite of test collections for CLIR evaluation over informal text

D Lawrie, J Mayfield, DW Oard, E Yang, S Nair… - Proceedings of the 46th …, 2023 - dl.acm.org
While there are many test collections for Cross-Language Information Retrieval (CLIR), none
of the large public test collections focus on short informal text documents. This paper …

Quantifying bias and variance of system rankings

GV Cormack, MR Grossman - … of the 42nd International ACM SIGIR …, 2019 - dl.acm.org
When used to assess the accuracy of system rankings, Kendall's tau and other rank
correlation measures conflate bias and variance as sources of error. We derive from tau a …

Continuous Evaluation in Information Retrieval

J Keller - 2023 - publiscologne.th-koeln.de
As the information era progresses, the sheer volume of information calls for sophisticated
retrieval systems. Evaluating them holds the key to ensuring the reliability and relevance of …

[PDF][PDF] HC4: A New Suite of Test Collections for Ad Hoc CLIR

E Yang - arXiv preprint arXiv:2201.09992, 2022 - academia.edu
HC4 is a new suite of test collections for ad hoc Cross-Language Information Retrieval
(CLIR), with Common Crawl News documents in Chinese, Persian, and Russian, topics in …

[PDF][PDF] HC4: A New Suite of Test Collections for Ad Hoc CLIR

DW Oard, E Yang - user.eng.umd.edu
HC4 is a new suite of test collections for ad hoc Cross-Language Information Retrieval
(CLIR), with Common Crawl News documents in Chinese, Persian, and Russian, topics in …

Test collections for web-scale datasets using Dynamic Sampling

A Singh - 2022 - uwspace.uwaterloo.ca
Dynamic Sampling is a non-uniform statistical sampling strategy based on S-CAL, a high-
recall retrieval algorithm. It is used for the construction of statistical test collections for …