[PDF][PDF] Is neural machine translation the new state of the art?

S Castilho, J Moorkens, F Gaspari… - The Prague Bulletin …, 2017 - archive.sciendo.com
This paper discusses neural machine translation (NMT), a new paradigm in the MT field,
comparing the quality of NMT systems with statistical MT by describing three studies using …

Doubly-attentive decoder for multi-modal neural machine translation

I Calixto, Q Liu, N Campbell - arXiv preprint arXiv:1702.01287, 2017 - arxiv.org
We introduce a Multi-modal Neural Machine Translation model in which a doubly-attentive
decoder naturally incorporates spatial visual features obtained using pre-trained …

Neural natural language generation: A survey on multilinguality, multimodality, controllability and learning

E Erdem, M Kuyu, S Yagcioglu, A Frank… - Journal of Artificial …, 2022 - jair.org
Developing artificial learning systems that can understand and generate natural language
has been one of the long-standing goals of artificial intelligence. Recent decades have …

Exploring better text image translation with multimodal codebook

Z Lan, J Yu, X Li, W Zhang, J Luan, B Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Text image translation (TIT) aims to translate the source texts embedded in the image to
target translations, which has a wide range of applications and thus has important research …

Product-oriented machine translation with cross-modal cross-lingual pre-training

Y Song, S Chen, Q Jin, W Luo, J Xie… - Proceedings of the 29th …, 2021 - dl.acm.org
Translating e-commercial product descriptions, aka product-oriented machine translation
(PMT), is essential to serve e-shoppers all over the world. However, due to the domain …

PEIT: Bridging the Modality Gap with Pre-trained Models for End-to-End Image Translation

S Zhu, S Li, Y Lei, D Xiong - … of the 61st Annual Meeting of the …, 2023 - aclanthology.org
Image translation is a task that translates an image containing text in the source language to
the target language. One major challenge with image translation is the modality gap …

Enhancing neural machine translation with dual-side multimodal awareness

Y Song, S Chen, Q Jin, W Luo, J Xie… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Multimodal machine translation (MMT) aims to translate a sentence in the source language
into the target language with the context of an associated image. According to where the …

From words to sentences: A progressive learning approach for zero-resource machine translation with visual pivots

S Chen, Q Jin, J Fu - arXiv preprint arXiv:1906.00872, 2019 - arxiv.org
The neural machine translation model has suffered from the lack of large-scale parallel
corpora. In contrast, we humans can learn multi-lingual translations even without parallel …

The steep road to happily ever after: An analysis of current visual storytelling models

Y Modi, N Parde - arXiv preprint arXiv:1904.03366, 2019 - arxiv.org
Visual storytelling is an intriguing and complex task that only recently entered the research
arena. In this work, we survey relevant work to date, and conduct a thorough error analysis …

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

H Shen, L Shao, W Li, Z Lan, Z Liu, J Su - arXiv preprint arXiv:2405.12669, 2024 - arxiv.org
In recent years, multi-modal machine translation has attracted significant interest in both
academia and industry due to its superior performance. It takes both textual and visual …