Neural machine translation: Challenges, progress and future

J Zhang, C Zong - Science China Technological Sciences, 2020 - Springer
Abstract Machine translation (MT) is a technique that leverages computers to translate
human languages automatically. Nowadays, neural machine translation (NMT) which …

Deep vision multimodal learning: Methodology, benchmark, and trend

W Chai, G Wang - Applied Sciences, 2022 - mdpi.com
Deep vision multimodal learning aims at combining deep visual representation learning with
other modalities, such as text, sound, and data collected from other sensors. With the fast …

[PDF][PDF] Is neural machine translation the new state of the art?

S Castilho, J Moorkens, F Gaspari… - The Prague Bulletin …, 2017 - archive.sciendo.com
This paper discusses neural machine translation (NMT), a new paradigm in the MT field,
comparing the quality of NMT systems with statistical MT by describing three studies using …

Multimodal transformer for multimodal machine translation

S Yao, X Wan - Proceedings of the 58th annual meeting of the …, 2020 - aclanthology.org
Abstract Multimodal Machine Translation (MMT) aims to introduce information from other
modality, generally static images, to improve the translation quality. Previous works propose …

A novel graph-based multi-modal fusion encoder for neural machine translation

Y Yin, F Meng, J Su, C Zhou, Z Yang, J Zhou… - arXiv preprint arXiv …, 2020 - arxiv.org
Multi-modal neural machine translation (NMT) aims to translate source sentences into a
target language paired with images. However, dominant multi-modal NMT models do not …

Trends in integration of vision and language research: A survey of tasks, datasets, and methods

A Mogadala, M Kalimuthu, D Klakow - Journal of Artificial Intelligence …, 2021 - jair.org
Abstract Interest in Artificial Intelligence (AI) and its applications has seen unprecedented
growth in the last few years. This success can be partly attributed to the advancements made …

Uc2: Universal cross-lingual cross-modal vision-and-language pre-training

M Zhou, L Zhou, S Wang, Y Cheng… - Proceedings of the …, 2021 - openaccess.thecvf.com
Vision-and-language pre-training has achieved impressive success in learning multimodal
representations between vision and language. To generalize this success to non-English …

Neural machine translation with universal visual representation

Z Zhang, K Chen, R Wang, M Utiyama… - International …, 2020 - openreview.net
Though visual information has been introduced for enhancing neural machine translation
(NMT), its effectiveness strongly relies on the availability of large amounts of bilingual …

Incorporating global visual features into attention-based neural machine translation

I Calixto, Q Liu, N Campbell - arXiv preprint arXiv:1701.06521, 2017 - arxiv.org
We introduce multi-modal, attention-based neural machine translation (NMT) models which
incorporate visual features into different parts of both the encoder and the decoder. We …

Dynamic context-guided capsule network for multimodal machine translation

H Lin, F Meng, J Su, Y Yin, Z Yang, Y Ge… - Proceedings of the 28th …, 2020 - dl.acm.org
Multimodal machine translation (MMT), which mainly focuses on enhancing text-only
translation with visual features, has attracted considerable attention from both computer …