Cross-language information retrieval

P Galuščáková, DW Oard, S Nair - arXiv preprint arXiv:2111.05988, 2021 - arxiv.org
Two key assumptions shape the usual view of ranked retrieval:(1) that the searcher can
choose words for their query that might appear in the documents that they wish to see, and …

CLIRMatrix: A massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval

S Sun, K Duh - Proceedings of the 2020 Conference on Empirical …, 2020 - aclanthology.org
We present CLIRMatrix, a massively large collection of bilingual and multilingual datasets
for Cross-Lingual Information Retrieval extracted automatically from Wikipedia. CLIRMatrix …

HC4: A new suite of test collections for ad hoc CLIR

D Lawrie, J Mayfield, DW Oard, E Yang - European Conference on …, 2022 - Springer
HC4 is a new suite of test collections for ad hoc Cross-Language Information Retrieval
(CLIR), with Common Crawl News documents in Chinese, Persian, and Russian, topics in …

Soft Prompt Decoding for Multilingual Dense Retrieval

Z Huang, H Zeng, H Zamani, J Allan - Proceedings of the 46th …, 2023 - dl.acm.org
In this work, we explore a Multilingual Information Retrieval (MLIR) task, where the collection
includes documents in multiple languages. We demonstrate that applying state-of-the-art …

Mixed attention transformer for leveraging word-level knowledge to neural cross-lingual information retrieval

Z Huang, H Bonab, SM Sarwar, R Rahimi… - Proceedings of the 30th …, 2021 - dl.acm.org
Pre-trained contextualized representations offer great success for many downstream tasks,
including document ranking. The multilingual versions of such pre-trained representations …

Combining contextualized and non-contextualized query translations to improve CLIR

S Nair, P Galuscakova, DW Oard - … of the 43rd International ACM SIGIR …, 2020 - dl.acm.org
In cross-language information retrieval using probabilistic structured queries (PSQ),
translation probabilities from statistical machine translation act as a bridge between the …

Constraint translation candidates: A bridge between neural query translation and cross-lingual information retrieval

T Bi, L Yao, B Yang, H Zhang, W Luo… - arXiv preprint arXiv …, 2020 - arxiv.org
Query translation (QT) is a key component in cross-lingual information retrieval system
(CLIR). With the help of deep learning, neural machine translation (NMT) has shown …

Reliable confidence intervals for information retrieval evaluation using generative ai

H Oosterhuis, R Jagerman, Z Qin, X Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
The traditional evaluation of information retrieval (IR) systems is generally very costly as it
requires manual relevance annotation from human experts. Recent advancements in …

NCC: Neural concept compression for multilingual document recommendation

TM Tashu, M Lenz, T Horváth - Applied Soft Computing, 2023 - Elsevier
In this work, we propose a novel method for generating inter-lingual document
representations using neural network concept compression. The presented approach is …

Domain transfer based data augmentation for neural query translation

L Yao, B Yang, H Zhang, B Chen… - Proceedings of the 28th …, 2020 - aclanthology.org
Query translation (QT) serves as a critical factor in successful cross-lingual information
retrieval (CLIR). Due to the lack of parallel query samples, neural-based QT models are …