Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text
S Gehrmann, E Clark, T Sellam - Journal of Artificial Intelligence Research, 2023 - jair.org
Abstract Evaluation practices in natural language generation (NLG) have many known flaws,
but improved evaluation approaches are rarely widely adopted. This issue has become …
but improved evaluation approaches are rarely widely adopted. This issue has become …
XL-sum: Large-scale multilingual abstractive summarization for 44 languages
Contemporary works on abstractive text summarization have focused primarily on high-
resource languages like English, mostly due to the limited availability of datasets for low/mid …
resource languages like English, mostly due to the limited availability of datasets for low/mid …
WikiLingua: A new benchmark dataset for cross-lingual abstractive summarization
We introduce WikiLingua, a large-scale, multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. We extract article and summary pairs in 18 …
crosslingual abstractive summarization systems. We extract article and summary pairs in 18 …
A survey on cross-lingual summarization
Cross-lingual summarization is the task of generating a summary in one language (eg,
English) for the given document (s) in a different language (eg, Chinese). Under the …
English) for the given document (s) in a different language (eg, Chinese). Under the …
MLSUM: The multilingual summarization corpus
We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained
from online newspapers, it contains 1.5 M+ article/summary pairs in five different languages …
from online newspapers, it contains 1.5 M+ article/summary pairs in five different languages …
Centroid-based text summarization through compositionality of word embeddings
The textual similarity is a crucial aspect for many extractive text summarization methods. A
bag-of-words representation does not allow to grasp the semantic relationships between …
bag-of-words representation does not allow to grasp the semantic relationships between …
[图书][B] Quality estimation for machine translation
Many applications within natural language processing involve performing text-to-text
transformations, ie, given a text in natural language as input, systems are required to …
transformations, ie, given a text in natural language as input, systems are required to …
Revisiting non-English text simplification: A unified multilingual benchmark
Recent advancements in high-quality, large-scale English resources have pushed the
frontier of English Automatic Text Simplification (ATS) research. However, less work has …
frontier of English Automatic Text Simplification (ATS) research. However, less work has …
Towards unifying multi-lingual and cross-lingual summarization
To adapt text summarization to the multilingual world, previous work proposes multi-lingual
summarization (MLS) and cross-lingual summarization (CLS). However, these two tasks …
summarization (MLS) and cross-lingual summarization (CLS). However, these two tasks …
Summary-oriented vision modeling for multimodal abstractive summarization
Multimodal abstractive summarization (MAS) aims to produce a concise summary given the
multimodal data (text and vision). Existing studies mainly focus on how to effectively use the …
multimodal data (text and vision). Existing studies mainly focus on how to effectively use the …