Fully automatic semantic MT evaluation

AB Sai, AK Mohankumar, MM Khapra - ACM Computing Surveys (CSUR …, 2022 - dl.acm.org

In the last few years, a large number of automatic evaluation metrics have been proposed for
evaluating Natural Language Generation (NLG) systems. The rapid development and …

被引用次数：234 相关文章所有 4 个版本

[PDF] arxiv.org

Translation quality assessment: A brief survey on manual and automatic methods

L Han, GJF Jones, AF Smeaton - arXiv preprint arXiv:2105.03311, 2021 - arxiv.org

To facilitate effective translation modeling and translation studies, one of the crucial
questions to address is how to assess translation quality. From the perspectives of accuracy …

被引用次数：47 相关文章所有 12 个版本

[PDF] arxiv.org

Spice: Semantic propositional image caption evaluation

P Anderson, B Fernando, M Johnson… - Computer Vision–ECCV …, 2016 - Springer

There is considerable interest in the task of automatically generating image captions.
However, evaluation is challenging. Existing automatic evaluation metrics are primarily …

被引用次数：2108 相关文章所有 13 个版本

[PDF] arxiv.org

Semantic structural evaluation for text simplification

E Sulem, O Abend, A Rappoport - arXiv preprint arXiv:1810.05022, 2018 - arxiv.org

Current measures for evaluating text simplification systems focus on evaluating lexical text
aspects, neglecting its structural aspects. In this paper we propose the first measure to …

被引用次数：109 相关文章所有 8 个版本

[PDF] aclanthology.org

[PDF][PDF] Using discourse structure improves machine translation evaluation

F Guzmán, S Joty, L Màrquez… - Proceedings of the 52nd …, 2014 - aclanthology.org

We present experiments in using discourse structure for improving machine translation
evaluation. We first design two discourse-aware similarity measures, which use all-subtree …

被引用次数：121 相关文章所有 3 个版本

[PDF] aclanthology.org

[PDF][PDF] Blend: a novel combined MT metric based on direct assessment—CASICT-DCU submission to WMT17 metrics task

Q Ma, Y Graham, S Wang, Q Liu - Proceedings of the second …, 2017 - aclanthology.org

Existing metrics to evaluate the quality of Machine Translation hypotheses take different
perspectives into account. DPM-Fcomb, a metric combining the merits of a range of metrics …

被引用次数：65 相关文章所有 5 个版本

[PDF] arxiv.org

Machine translation evaluation with neural networks

F Guzmán, S Joty, L Màrquez, P Nakov - Computer Speech & Language, 2017 - Elsevier

We present a framework for machine translation evaluation using neural networks in a
pairwise setting, where the goal is to select the better translation from a pair of hypotheses …

被引用次数：48 相关文章所有 6 个版本

[PDF] arxiv.org

Pairwise neural machine translation evaluation

F Guzmán, S Joty, L Màrquez, P Nakov - arXiv preprint arXiv:1912.03135, 2019 - arxiv.org

We present a novel framework for machine translation evaluation using neural networks in a
pairwise setting, where the goal is to select the better translation from a pair of hypotheses …

被引用次数：54 相关文章所有 3 个版本

[PDF] arxiv.org

Machine translation evaluation resources and methods: A survey

L Han - arXiv preprint arXiv:1605.04515, 2016 - arxiv.org

We introduce the Machine Translation (MT) evaluation survey that contains both manual and
automatic evaluation methods. The traditional human evaluation criteria mainly include the …

被引用次数：38 相关文章所有 6 个版本

[PDF] mit.edu

Discourse structure in machine translation evaluation

S Joty, F Guzmán, L Màrquez, P Nakov - Computational Linguistics, 2017 - direct.mit.edu

In this article, we explore the potential of using sentence-level discourse structure for
machine translation evaluation. We first design discourse-aware similarity measures, which …

被引用次数：32 相关文章所有 11 个版本