Unbiased Learning to Rank Meets Reality: Lessons from Baidu's Large-Scale Search Dataset

P Hager, R Deffayet, JM Renders, O Zoeter… - Proceedings of the 47th …, 2024 - dl.acm.org
Unbiased learning-to-rank (ULTR) is a well-established framework for learning from user
clicks, which are often biased by the ranker collecting the data. While theoretically justified …

Pre-trained language model-based retrieval and ranking for web search

L Zou, W Lu, Y Liu, H Cai, X Chu, D Ma, D Shi… - ACM Transactions on …, 2022 - dl.acm.org
Pre-trained language representation models (PLMs) such as BERT and Enhanced
Representation through kNowledge IntEgration (ERNIE) have been integral to achieving …

Model-based unbiased learning to rank

D Luo, L Zou, Q Ai, Z Chen, D Yin… - Proceedings of the …, 2023 - dl.acm.org
Unbiased Learning to Rank (ULTR), ie, learning to rank documents with biased user
feedback data, is a well-known challenge in information retrieval. Existing methods in …

S2phere: Semi-Supervised Pre-training for Web Search over Heterogeneous Learning to Rank Data

Y Li, H Xiong, L Kong, Q Wang, S Wang… - Proceedings of the 29th …, 2023 - dl.acm.org
While Learning to Rank (LTR) models on top of transformers have been widely adopted to
achieve decent performance, it is still challenging to train the model with sufficient data as …

Unbiased Learning-to-Rank Needs Unconfounded Propensity Estimation

D Luo, L Zou, Q Ai, Z Chen, C Li, D Yin… - Proceedings of the 47th …, 2024 - dl.acm.org
The logs of the use of a search engine provide sufficient data to train a better ranker.
However, it is well known that such implicit feedback reflects biases, and in particular a …

Towards better web search performance: pre-training, fine-tuning and learning to rank

H Li, J Chen, W Su, Q Ai, Y Liu - arXiv preprint arXiv:2303.04710, 2023 - arxiv.org
This paper describes the approach of the THUIR team at the WSDM Cup 2023 Pre-training
for Web Search task. This task requires the participant to rank the relevant documents for …

Feature-Enhanced Network with Hybrid Debiasing Strategies for Unbiased Learning to Rank

L Yu, Y Wang, X Sun, K Bi, J Guo - arXiv preprint arXiv:2302.07530, 2023 - arxiv.org
Unbiased learning to rank (ULTR) aims to mitigate various biases existing in user clicks,
such as position bias, trust bias, presentation bias, and learn an effective ranker. In this …

CIR at the NTCIR-17 ULTRE-2 Task

L Yu, K Bi, J Guo, X Cheng - arXiv preprint arXiv:2310.11852, 2023 - arxiv.org
The Chinese academy of sciences Information Retrieval team (CIR) has participated in the
NTCIR-17 ULTRE-2 task. This paper describes our approaches and reports our results on …

CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking

J Vonásek, M Straka, R Krč, L Lasonová… - Proceedings of the 47th …, 2024 - dl.acm.org
We present CWRCzech, Click Web Ranking dataset for Czech, a 100M query-document
Czech click dataset for relevance ranking with user behavior data collected from search …

Multi-feature integration for perception-dependent examination-bias estimation

X Chen, X Li, K Wei, B Hu, L Jiang, Z Huang… - arXiv preprint arXiv …, 2023 - arxiv.org
Eliminating examination bias accurately is pivotal to apply click-through data to train an
unbiased ranking model. However, most examination-bias estimators are limited to the …