Vatex: A large-scale, high-quality multilingual dataset for video-and-language research
We present a new large-scale multilingual video description dataset, VATEX, which contains
over 41,250 videos and 825,000 captions in both English and Chinese. Among the captions …
over 41,250 videos and 825,000 captions in both English and Chinese. Among the captions …
Findings of the second shared task on multimodal machine translation and multilingual image description
We present the results from the second shared task on multimodal machine translation and
multilingual image description. Nine teams submitted 19 systems to two tasks. The …
multilingual image description. Nine teams submitted 19 systems to two tasks. The …
Probing the need for visual context in multimodal machine translation
Current work on multimodal machine translation (MMT) has suggested that the visual
modality is either unnecessary or only marginally beneficial. We posit that this is a …
modality is either unnecessary or only marginally beneficial. We posit that this is a …
Multimodality information fusion for automated machine translation
Abstract Machine translation is a popular automation approach for translating texts between
different languages. Although traditionally it has a strong focus on natural language, images …
different languages. Although traditionally it has a strong focus on natural language, images …
Distilling translations with visual awareness
Previous work on multimodal machine translation has shown that visual information is only
needed in very specific cases, for example in the presence of ambiguous words where the …
needed in very specific cases, for example in the presence of ambiguous words where the …
[HTML][HTML] Multimodal machine translation through visuals and speech
Multimodal machine translation involves drawing information from more than one modality,
based on the assumption that the additional modalities will contain useful alternative views …
based on the assumption that the additional modalities will contain useful alternative views …
[PDF][PDF] 神经机器翻译前沿综述
冯洋, 邵晨泽 - 中文信息学报, 2020 - jcip.cipsc.org.cn
机器翻译是指通过计算机将源语言句子翻译到与之语义等价的目标语言句子的过程,
是自然语言处理领域的一个重要研究方向. 神经机器翻译仅需使用神经网络就能实现从源语言到 …
是自然语言处理领域的一个重要研究方向. 神经机器翻译仅需使用神经网络就能实现从源语言到 …
A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges
In recent years, multi-modal machine translation has attracted significant interest in both
academia and industry due to its superior performance. It takes both textual and visual …
academia and industry due to its superior performance. It takes both textual and visual …
Dose multimodal machine translation can improve translation performance?
SD Cui, K Duan, W Ma, H Shinnou - Neural Computing and Applications, 2024 - Springer
Multimodal machine translation (MMT) is a method that uses visual information to guide text
translation. However, recent studies have engendered controversy regarding the extent to …
translation. However, recent studies have engendered controversy regarding the extent to …
Multimodal machine translation
O Caglayan - 2019 - theses.hal.science
Machine translation aims at automatically translating documents from one language to
another without human intervention. With the advent of deep neural networks (DNN), neural …
another without human intervention. With the advent of deep neural networks (DNN), neural …