A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks with different data modalities. A PFM (eg, BERT, ChatGPT, and GPT-4) is …

An empirical survey on long document summarization: Datasets, models, and metrics

HY Koh, J Ju, M Liu, S Pan - ACM computing surveys, 2022 - dl.acm.org
Long documents such as academic articles and business reports have been the standard
format to detail out important issues and complicated subjects that require extra attention. An …

SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

P Laban, T Schnabel, PN Bennett… - Transactions of the …, 2022 - direct.mit.edu
In the summarization domain, a key requirement for summaries is to be factually consistent
with the input document. Previous work has found that natural language inference (NLI) …

Chatgpt as a factual inconsistency evaluator for text summarization

Z Luo, Q Xie, S Ananiadou - arXiv preprint arXiv:2303.15621, 2023 - arxiv.org
The performance of text summarization has been greatly boosted by pre-trained language
models. A main concern of existing methods is that most generated summaries are not …

QAFactEval: Improved QA-based factual consistency evaluation for summarization

AR Fabbri, CS Wu, W Liu, C Xiong - arXiv preprint arXiv:2112.08542, 2021 - arxiv.org
Factual consistency is an essential quality of text summarization models in practical settings.
Existing work in evaluating this dimension can be broadly categorized into two lines of …

[HTML][HTML] Pre-trained language models with domain knowledge for biomedical extractive summarization

Q Xie, JA Bishop, P Tiwari, S Ananiadou - Knowledge-Based Systems, 2022 - Elsevier
Biomedical text summarization is a critical task for comprehension of an ever-growing
amount of biomedical literature. Pre-trained language models (PLMs) with transformer …

AlignScore: Evaluating factual consistency with a unified alignment function

Y Zha, Y Yang, R Li, Z Hu - arXiv preprint arXiv:2305.16739, 2023 - arxiv.org
Many text generation applications require the generated text to be factually consistent with
input information. Automatic evaluation of factual consistency is challenging. Previous work …

Understanding factual errors in summarization: Errors, summarizers, datasets, error detectors

L Tang, T Goyal, AR Fabbri, P Laban, J Xu… - arXiv preprint arXiv …, 2022 - arxiv.org
The propensity of abstractive summarization models to make factual errors has been studied
extensively, including design of metrics to detect factual errors and annotation of errors in …

Submodularity in machine learning and artificial intelligence

J Bilmes - arXiv preprint arXiv:2202.00132, 2022 - arxiv.org
In this manuscript, we offer a gentle review of submodularity and supermodularity and their
properties. We offer a plethora of submodular definitions; a full description of a number of …

SUMMEDITS: measuring LLM ability at factual reasoning through the lens of summarization

P Laban, W Kryściński, D Agarwal… - Proceedings of the …, 2023 - aclanthology.org
With the recent appearance of LLMs in practical settings, having methods that can effectively
detect factual inconsistencies is crucial to reduce the propagation of misinformation and …