Vatex: A large-scale, high-quality multilingual dataset for video-and-language research

X Wang, J Wu, J Chen, L Li… - Proceedings of the …, 2019 - openaccess.thecvf.com
We present a new large-scale multilingual video description dataset, VATEX, which contains
over 41,250 videos and 825,000 captions in both English and Chinese. Among the captions …

Findings of the second shared task on multimodal machine translation and multilingual image description

D Elliott, S Frank, L Barrault, F Bougares… - arXiv preprint arXiv …, 2017 - arxiv.org
We present the results from the second shared task on multimodal machine translation and
multilingual image description. Nine teams submitted 19 systems to two tasks. The …

Probing the need for visual context in multimodal machine translation

O Caglayan, P Madhyastha, L Specia… - arXiv preprint arXiv …, 2019 - arxiv.org
Current work on multimodal machine translation (MMT) has suggested that the visual
modality is either unnecessary or only marginally beneficial. We posit that this is a …

Multimodality information fusion for automated machine translation

L Li, T Tayir, Y Han, X Tao, JD Velásquez - Information Fusion, 2023 - Elsevier
Abstract Machine translation is a popular automation approach for translating texts between
different languages. Although traditionally it has a strong focus on natural language, images …

Distilling translations with visual awareness

J Ive, P Madhyastha, L Specia - arXiv preprint arXiv:1906.07701, 2019 - arxiv.org
Previous work on multimodal machine translation has shown that visual information is only
needed in very specific cases, for example in the presence of ambiguous words where the …

[HTML][HTML] Multimodal machine translation through visuals and speech

U Sulubacak, O Caglayan, SA Grönroos, A Rouhe… - Machine …, 2020 - Springer
Multimodal machine translation involves drawing information from more than one modality,
based on the assumption that the additional modalities will contain useful alternative views …

[PDF][PDF] 神经机器翻译前沿综述

冯洋, 邵晨泽 - 中文信息学报, 2020 - jcip.cipsc.org.cn
机器翻译是指通过计算机将源语言句子翻译到与之语义等价的目标语言句子的过程,
是自然语言处理领域的一个重要研究方向. 神经机器翻译仅需使用神经网络就能实现从源语言到 …

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

H Shen, L Shao, W Li, Z Lan, Z Liu, J Su - arXiv preprint arXiv:2405.12669, 2024 - arxiv.org
In recent years, multi-modal machine translation has attracted significant interest in both
academia and industry due to its superior performance. It takes both textual and visual …

Dose multimodal machine translation can improve translation performance?

SD Cui, K Duan, W Ma, H Shinnou - Neural Computing and Applications, 2024 - Springer
Multimodal machine translation (MMT) is a method that uses visual information to guide text
translation. However, recent studies have engendered controversy regarding the extent to …

Multimodal machine translation

O Caglayan - 2019 - theses.hal.science
Machine translation aims at automatically translating documents from one language to
another without human intervention. With the advent of deep neural networks (DNN), neural …