Multi-document summarization via deep learning techniques: A survey

C Ma, WE Zhang, M Guo, H Wang, QZ Sheng - ACM Computing Surveys, 2022 - dl.acm.org
Multi-document summarization (MDS) is an effective tool for information aggregation that
generates an informative and concise summary from a cluster of topic-related documents …

Summeval: Re-evaluating summarization evaluation

AR Fabbri, W Kryściński, B McCann, C Xiong… - Transactions of the …, 2021 - direct.mit.edu
The scarcity of comprehensive up-to-date studies on evaluation metrics for text
summarization and the lack of consensus regarding evaluation protocols continue to inhibit …

Neural text summarization: A critical evaluation

W Kryściński, NS Keskar, B McCann, C Xiong… - arXiv preprint arXiv …, 2019 - arxiv.org
Text summarization aims at compressing long documents into a shorter form that conveys
the most important parts of the original document. Despite increased interest in the …

SUPERT: Towards new frontiers in unsupervised evaluation metrics for multi-document summarization

Y Gao, W Zhao, S Eger - arXiv preprint arXiv:2005.03724, 2020 - arxiv.org
We study unsupervised multi-document summarization evaluation metrics, which require
neither human-written reference summaries nor human annotations (eg preferences …

A comprehensive review of automatic text summarization techniques: method, data, evaluation and coding

DO Cajueiro, AG Nery, I Tavares, MK De Melo… - arXiv preprint arXiv …, 2023 - arxiv.org
We provide a literature review about Automatic Text Summarization (ATS) systems. We
consider a citation-based approach. We start with some popular and well-known papers that …

MRC-Sum: An MRC framework for extractive summarization of academic articles in natural sciences and medicine

S Li, J Xu - Information Processing & Management, 2023 - Elsevier
Extractive summarization for academic articles in natural sciences and medicine has
attracted attention for a long time. However, most existing extractive summarization models …

ROUGE-SEM: Better evaluation of summarization using ROUGE combined with semantics

M Zhang, C Li, M Wan, X Zhang, Q Zhao - Expert Systems with Applications, 2024 - Elsevier
With the development of pre-trained language models and large-scale datasets, automatic
text summarization has attracted much attention from the community of natural language …

GRUEN for evaluating linguistic quality of generated text

W Zhu, S Bhat - arXiv preprint arXiv:2010.02498, 2020 - arxiv.org
Automatic evaluation metrics are indispensable for evaluating generated text. To date, these
metrics have focused almost exclusively on the content selection aspect of the system …

How Far are We from Robust Long Abstractive Summarization?

HY Koh, J Ju, H Zhang, M Liu, S Pan - arXiv preprint arXiv:2210.16732, 2022 - arxiv.org
Abstractive summarization has made tremendous progress in recent years. In this work, we
perform fine-grained human annotations to evaluate long document abstractive …

A global analysis of metrics used for measuring performance in natural language processing

K Blagec, G Dorffner, M Moradi, S Ott… - arXiv preprint arXiv …, 2022 - arxiv.org
Measuring the performance of natural language processing models is challenging.
Traditionally used metrics, such as BLEU and ROUGE, originally devised for machine …