A survey on recent advances in keyphrase extraction from pre-trained language models
Keyphrase Extraction (KE) is a critical component in Natural Language Processing (NLP)
systems for selecting a set of phrases from the document that could summarize the important …
systems for selecting a set of phrases from the document that could summarize the important …
ETC: Encoding long and structured inputs in transformers
Transformer models have advanced the state of the art in many Natural Language
Processing (NLP) tasks. In this paper, we present a new Transformer architecture, Extended …
Processing (NLP) tasks. In this paper, we present a new Transformer architecture, Extended …
From statistical methods to deep learning, automatic keyphrase prediction: A survey
Keyphrase prediction aims to generate phrases (keyphrases) that highly summarizes a
given document. Recently, researchers have conducted in-depth studies on this task from …
given document. Recently, researchers have conducted in-depth studies on this task from …
Learning-to-Rank with BERT in TF-Ranking
This paper describes a machine learning algorithm for document (re) ranking, in which
queries and documents are firstly encoded using BERT [1], and on top of that a learning-to …
queries and documents are firstly encoded using BERT [1], and on top of that a learning-to …
A new direction in stance detection: Target-stance extraction in the wild
Stance detection aims to detect the stance toward a corresponding target. Existing works
use the assumption that the target is known in advance, which is often not the case in the …
use the assumption that the target is known in advance, which is often not the case in the …
ERNIE-Doc: A retrospective long-document modeling transformer
Transformers are not suited for processing long documents, due to their quadratically
increasing memory and time consumption. Simply truncating a long document or applying …
increasing memory and time consumption. Simply truncating a long document or applying …
ClueWeb22: 10 billion web documents with rich information
A Overwijk, C Xiong, J Callan - … of the 45th international ACM SIGIR …, 2022 - dl.acm.org
ClueWeb22, the newest iteration of the ClueWeb line of datasets, is the result of more than a
year of collaboration between industry and academia. Its design is influenced by the …
year of collaboration between industry and academia. Its design is influenced by the …
[PDF][PDF] A Systematic Literature Review of Keyphrases Extraction Approaches.
The keyphrases of a document are the textual units that characterize its content such as the
topics it addresses, its ideas, their field, etc. Thousands of books, articles and web pages are …
topics it addresses, its ideas, their field, etc. Thousands of books, articles and web pages are …
Is chatgpt a good keyphrase generator? a preliminary study
The emergence of ChatGPT has recently garnered significant attention from the
computational linguistics community. To demonstrate its capabilities as a keyphrase …
computational linguistics community. To demonstrate its capabilities as a keyphrase …
Information extraction from text intensive and visually rich banking documents
Document types, where visual and textual information plays an important role in their
analysis and understanding, pose a new and attractive area for information extraction …
analysis and understanding, pose a new and attractive area for information extraction …