Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text

S Gehrmann, E Clark, T Sellam - Journal of Artificial Intelligence Research, 2023 - jair.org
Abstract Evaluation practices in natural language generation (NLG) have many known flaws,
but improved evaluation approaches are rarely widely adopted. This issue has become …

XL-sum: Large-scale multilingual abstractive summarization for 44 languages

T Hasan, A Bhattacharjee, MS Islam, K Samin… - arXiv preprint arXiv …, 2021 - arxiv.org
Contemporary works on abstractive text summarization have focused primarily on high-
resource languages like English, mostly due to the limited availability of datasets for low/mid …

WikiLingua: A new benchmark dataset for cross-lingual abstractive summarization

F Ladhak, E Durmus, C Cardie, K McKeown - arXiv preprint arXiv …, 2020 - arxiv.org
We introduce WikiLingua, a large-scale, multilingual dataset for the evaluation of
crosslingual abstractive summarization systems. We extract article and summary pairs in 18 …

A survey on cross-lingual summarization

J Wang, F Meng, D Zheng, Y Liang, Z Li… - Transactions of the …, 2022 - direct.mit.edu
Cross-lingual summarization is the task of generating a summary in one language (eg,
English) for the given document (s) in a different language (eg, Chinese). Under the …

MLSUM: The multilingual summarization corpus

T Scialom, PA Dray, S Lamprier, B Piwowarski… - arXiv preprint arXiv …, 2020 - arxiv.org
We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained
from online newspapers, it contains 1.5 M+ article/summary pairs in five different languages …

Centroid-based text summarization through compositionality of word embeddings

G Rossiello, P Basile, G Semeraro - Proceedings of the multiling …, 2017 - aclanthology.org
The textual similarity is a crucial aspect for many extractive text summarization methods. A
bag-of-words representation does not allow to grasp the semantic relationships between …

[图书][B] Quality estimation for machine translation

L Specia, C Scarton, GH Paetzold - 2022 - books.google.com
Many applications within natural language processing involve performing text-to-text
transformations, ie, given a text in natural language as input, systems are required to …

Revisiting non-English text simplification: A unified multilingual benchmark

MJ Ryan, T Naous, W Xu - arXiv preprint arXiv:2305.15678, 2023 - arxiv.org
Recent advancements in high-quality, large-scale English resources have pushed the
frontier of English Automatic Text Simplification (ATS) research. However, less work has …

Towards unifying multi-lingual and cross-lingual summarization

J Wang, F Meng, D Zheng, Y Liang, Z Li, J Qu… - arXiv preprint arXiv …, 2023 - arxiv.org
To adapt text summarization to the multilingual world, previous work proposes multi-lingual
summarization (MLS) and cross-lingual summarization (CLS). However, these two tasks …

Summary-oriented vision modeling for multimodal abstractive summarization

Y Liang, F Meng, J Xu, J Wang, Y Chen… - arXiv preprint arXiv …, 2022 - arxiv.org
Multimodal abstractive summarization (MAS) aims to produce a concise summary given the
multimodal data (text and vision). Existing studies mainly focus on how to effectively use the …