A survey on recent advances in keyphrase extraction from pre-trained language models

M Song, Y Feng, L Jing - Findings of the Association for …, 2023 - aclanthology.org
Keyphrase Extraction (KE) is a critical component in Natural Language Processing (NLP)
systems for selecting a set of phrases from the document that could summarize the important …

ETC: Encoding long and structured inputs in transformers

J Ainslie, S Ontanon, C Alberti, V Cvicek… - arXiv preprint arXiv …, 2020 - arxiv.org
Transformer models have advanced the state of the art in many Natural Language
Processing (NLP) tasks. In this paper, we present a new Transformer architecture, Extended …

From statistical methods to deep learning, automatic keyphrase prediction: A survey

B Xie, J Song, L Shao, S Wu, X Wei, B Yang… - Information Processing …, 2023 - Elsevier
Keyphrase prediction aims to generate phrases (keyphrases) that highly summarizes a
given document. Recently, researchers have conducted in-depth studies on this task from …

Learning-to-Rank with BERT in TF-Ranking

S Han, X Wang, M Bendersky, M Najork - arXiv preprint arXiv:2004.08476, 2020 - arxiv.org
This paper describes a machine learning algorithm for document (re) ranking, in which
queries and documents are firstly encoded using BERT [1], and on top of that a learning-to …

A new direction in stance detection: Target-stance extraction in the wild

Y Li, K Garg, C Caragea - Proceedings of the 61st Annual Meeting …, 2023 - aclanthology.org
Stance detection aims to detect the stance toward a corresponding target. Existing works
use the assumption that the target is known in advance, which is often not the case in the …

ERNIE-Doc: A retrospective long-document modeling transformer

S Ding, J Shang, S Wang, Y Sun, H Tian, H Wu… - arXiv preprint arXiv …, 2020 - arxiv.org
Transformers are not suited for processing long documents, due to their quadratically
increasing memory and time consumption. Simply truncating a long document or applying …

ClueWeb22: 10 billion web documents with rich information

A Overwijk, C Xiong, J Callan - … of the 45th international ACM SIGIR …, 2022 - dl.acm.org
ClueWeb22, the newest iteration of the ClueWeb line of datasets, is the result of more than a
year of collaboration between industry and academia. Its design is influenced by the …

[PDF][PDF] A Systematic Literature Review of Keyphrases Extraction Approaches.

L Ajallouda, FZ Fagroud, A Zellou… - Int. J. Interact. Mob …, 2022 - academia.edu
The keyphrases of a document are the textual units that characterize its content such as the
topics it addresses, its ideas, their field, etc. Thousands of books, articles and web pages are …

Is chatgpt a good keyphrase generator? a preliminary study

M Song, H Jiang, S Shi, S Yao, S Lu, Y Feng… - arXiv preprint arXiv …, 2023 - arxiv.org
The emergence of ChatGPT has recently garnered significant attention from the
computational linguistics community. To demonstrate its capabilities as a keyphrase …

Information extraction from text intensive and visually rich banking documents

B Oral, E Emekligil, S Arslan, G Eryiǧit - Information Processing & …, 2020 - Elsevier
Document types, where visual and textual information plays an important role in their
analysis and understanding, pose a new and attractive area for information extraction …