A survey of evaluation metrics used for NLG systems

AB Sai, AK Mohankumar, MM Khapra - ACM Computing Surveys (CSUR …, 2022 - dl.acm.org
In the last few years, a large number of automatic evaluation metrics have been proposed for
evaluating Natural Language Generation (NLG) systems. The rapid development and …

Translation quality assessment: A brief survey on manual and automatic methods

L Han, GJF Jones, AF Smeaton - arXiv preprint arXiv:2105.03311, 2021 - arxiv.org
To facilitate effective translation modeling and translation studies, one of the crucial
questions to address is how to assess translation quality. From the perspectives of accuracy …

Spice: Semantic propositional image caption evaluation

P Anderson, B Fernando, M Johnson… - Computer Vision–ECCV …, 2016 - Springer
There is considerable interest in the task of automatically generating image captions.
However, evaluation is challenging. Existing automatic evaluation metrics are primarily …

Semantic structural evaluation for text simplification

E Sulem, O Abend, A Rappoport - arXiv preprint arXiv:1810.05022, 2018 - arxiv.org
Current measures for evaluating text simplification systems focus on evaluating lexical text
aspects, neglecting its structural aspects. In this paper we propose the first measure to …

[PDF][PDF] Using discourse structure improves machine translation evaluation

F Guzmán, S Joty, L Màrquez… - Proceedings of the 52nd …, 2014 - aclanthology.org
We present experiments in using discourse structure for improving machine translation
evaluation. We first design two discourse-aware similarity measures, which use all-subtree …

[PDF][PDF] Blend: a novel combined MT metric based on direct assessment—CASICT-DCU submission to WMT17 metrics task

Q Ma, Y Graham, S Wang, Q Liu - Proceedings of the second …, 2017 - aclanthology.org
Existing metrics to evaluate the quality of Machine Translation hypotheses take different
perspectives into account. DPM-Fcomb, a metric combining the merits of a range of metrics …

Machine translation evaluation with neural networks

F Guzmán, S Joty, L Màrquez, P Nakov - Computer Speech & Language, 2017 - Elsevier
We present a framework for machine translation evaluation using neural networks in a
pairwise setting, where the goal is to select the better translation from a pair of hypotheses …

Pairwise neural machine translation evaluation

F Guzmán, S Joty, L Màrquez, P Nakov - arXiv preprint arXiv:1912.03135, 2019 - arxiv.org
We present a novel framework for machine translation evaluation using neural networks in a
pairwise setting, where the goal is to select the better translation from a pair of hypotheses …

Machine translation evaluation resources and methods: A survey

L Han - arXiv preprint arXiv:1605.04515, 2016 - arxiv.org
We introduce the Machine Translation (MT) evaluation survey that contains both manual and
automatic evaluation methods. The traditional human evaluation criteria mainly include the …

Discourse structure in machine translation evaluation

S Joty, F Guzmán, L Màrquez, P Nakov - Computational Linguistics, 2017 - direct.mit.edu
In this article, we explore the potential of using sentence-level discourse structure for
machine translation evaluation. We first design discourse-aware similarity measures, which …