Utilizing passage-based language models for document retrieval

C Li, A Yates, S MacAvaney, B He, Y Sun - ACM Transactions on …, 2023 - dl.acm.org

Pre-trained transformer models, such as BERT and T5, have shown to be highly effective at
ad hoc passage and document ranking. Due to the inherent sequence length limits of these …

被引用次数：173 相关文章所有 6 个版本

[PDF] arxiv.org

Local self-attention over long text for efficient document retrieval

S Hofstätter, H Zamani, B Mitra, N Craswell… - Proceedings of the 43rd …, 2020 - dl.acm.org

Neural networks, particularly Transformer-based architectures, have achieved significant
performance improvements on several retrieval benchmarks. When the items being …

被引用次数：95 相关文章所有 4 个版本

[PDF] arxiv.org

Modeling diverse relevance patterns in ad-hoc retrieval

Y Fan, J Guo, Y Lan, J Xu, C Zhai… - The 41st international ACM …, 2018 - dl.acm.org

Assessing relevance between a query and a document is challenging in ad-hoc retrieval
due to its diverse patterns, ie, a document could be relevant to a query as a whole or …

被引用次数：114 相关文章所有 6 个版本

[PDF] arxiv.org

Intra-document cascading: Learning to select passages for neural document ranking

S Hofstätter, B Mitra, H Zamani, N Craswell… - Proceedings of the 44th …, 2021 - dl.acm.org

An emerging recipe for achieving state-of-the-art effectiveness in neural document re-
ranking involves utilizing large pre-trained language models-eg, BERT-to evaluate all …

被引用次数：49 相关文章所有 3 个版本

[PDF] arxiv.org

PageRank without hyperlinks: Structural reranking using links induced by language models

O Kurland, L Lee - ACM Transactions on Information Systems (TOIS), 2010 - dl.acm.org

The ad hoc retrieval task is to find documents in a corpus that are relevant to a query.
Inspired by the PageRank and HITS (hubs and authorities) algorithms for Web search, we …

被引用次数：264 相关文章所有 17 个版本

[PDF] polytechnique.fr

Quality-biased ranking of web documents

M Bendersky, WB Croft, Y Diao - … conference on Web search and data …, 2011 - dl.acm.org

Many existing retrieval approaches do not take into account the content quality of the
retrieved documents, although link-based measures such as PageRank are commonly used …

被引用次数：158 相关文章所有 14 个版本

[PDF] aclanthology.org

Unsupervised FAQ retrieval with question generation and BERT

Y Mass, B Carmeli, H Roitman… - Proceedings of the 58th …, 2020 - aclanthology.org

We focus on the task of Frequently Asked Questions (FAQ) retrieval. A given user query can
be matched against the questions and/or the answers in the FAQ. We present a fully …

被引用次数：63 相关文章所有 3 个版本

[PDF] acm.org

BLADE: combining vocabulary pruning and intermediate pretraining for scaleable neural CLIR

S Nair, E Yang, D Lawrie, J Mayfield… - Proceedings of the 46th …, 2023 - dl.acm.org

Learning sparse representations using pretrained language models enhances the
monolingual ranking effectiveness. Such representations are sparse vectors in the …

被引用次数：9 相关文章所有 4 个版本

[PDF] psu.edu

Modeling higher-order term dependencies in information retrieval using query hypergraphs

M Bendersky, WB Croft - Proceedings of the 35th international ACM …, 2012 - dl.acm.org

Many of the recent, and more effective, retrieval models have incorporated dependencies
between the terms in the query. In this paper, we advance this query representation one step …

被引用次数：105 相关文章所有 7 个版本

[PDF] psu.edu

Finding text reuse on the web

M Bendersky, WB Croft - Proceedings of the Second ACM International …, 2009 - dl.acm.org

With the overwhelming number of reports on similar events originating from different sources
on the web, it is often hard, using existing web search paradigms, to find the original source …

被引用次数：115 相关文章所有 8 个版本