A survey of evaluation metrics used for NLG systems
In the last few years, a large number of automatic evaluation metrics have been proposed for
evaluating Natural Language Generation (NLG) systems. The rapid development and …
evaluating Natural Language Generation (NLG) systems. The rapid development and …
Translation quality assessment: A brief survey on manual and automatic methods
To facilitate effective translation modeling and translation studies, one of the crucial
questions to address is how to assess translation quality. From the perspectives of accuracy …
questions to address is how to assess translation quality. From the perspectives of accuracy …
Spice: Semantic propositional image caption evaluation
There is considerable interest in the task of automatically generating image captions.
However, evaluation is challenging. Existing automatic evaluation metrics are primarily …
However, evaluation is challenging. Existing automatic evaluation metrics are primarily …
Semantic structural evaluation for text simplification
Current measures for evaluating text simplification systems focus on evaluating lexical text
aspects, neglecting its structural aspects. In this paper we propose the first measure to …
aspects, neglecting its structural aspects. In this paper we propose the first measure to …
[PDF][PDF] Using discourse structure improves machine translation evaluation
We present experiments in using discourse structure for improving machine translation
evaluation. We first design two discourse-aware similarity measures, which use all-subtree …
evaluation. We first design two discourse-aware similarity measures, which use all-subtree …
[PDF][PDF] Blend: a novel combined MT metric based on direct assessment—CASICT-DCU submission to WMT17 metrics task
Existing metrics to evaluate the quality of Machine Translation hypotheses take different
perspectives into account. DPM-Fcomb, a metric combining the merits of a range of metrics …
perspectives into account. DPM-Fcomb, a metric combining the merits of a range of metrics …
Machine translation evaluation with neural networks
We present a framework for machine translation evaluation using neural networks in a
pairwise setting, where the goal is to select the better translation from a pair of hypotheses …
pairwise setting, where the goal is to select the better translation from a pair of hypotheses …
Pairwise neural machine translation evaluation
We present a novel framework for machine translation evaluation using neural networks in a
pairwise setting, where the goal is to select the better translation from a pair of hypotheses …
pairwise setting, where the goal is to select the better translation from a pair of hypotheses …
Machine translation evaluation resources and methods: A survey
L Han - arXiv preprint arXiv:1605.04515, 2016 - arxiv.org
We introduce the Machine Translation (MT) evaluation survey that contains both manual and
automatic evaluation methods. The traditional human evaluation criteria mainly include the …
automatic evaluation methods. The traditional human evaluation criteria mainly include the …
Discourse structure in machine translation evaluation
In this article, we explore the potential of using sentence-level discourse structure for
machine translation evaluation. We first design discourse-aware similarity measures, which …
machine translation evaluation. We first design discourse-aware similarity measures, which …