Neural machine translation with phrase-level universal visual representations

Q Fang, Y Feng - arXiv preprint arXiv:2203.10299, 2022 - arxiv.org
Multimodal machine translation (MMT) aims to improve neural machine translation (NMT)
with additional visual information, but most existing MMT methods require paired input of …

Exploring better text image translation with multimodal codebook

Z Lan, J Yu, X Li, W Zhang, J Luan, B Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Text image translation (TIT) aims to translate the source texts embedded in the image to
target translations, which has a wide range of applications and thus has important research …

Layer-level progressive transformer with modality difference awareness for multi-modal neural machine translation

J Guo, J Ye, Y Xiang, Z Yu - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
Multi-modal neural machine translation (MNMT) aims to translate sentences from the source
language into the target language with the aid of corresponding images. Unfortunately, there …

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

H Shen, L Shao, W Li, Z Lan, Z Liu, J Su - arXiv preprint arXiv:2405.12669, 2024 - arxiv.org
In recent years, multi-modal machine translation has attracted significant interest in both
academia and industry due to its superior performance. It takes both textual and visual …

Research on the Application of Prompt Learning Pretrained Language Model in Machine Translation Task with Reinforcement Learning

C Wang, Z Li, T Chen, R Wang, Z Ju - Electronics, 2023 - mdpi.com
With the continuous advancement of deep learning technology, pretrained language models
have emerged as crucial tools for natural language processing tasks. However, optimization …

[PDF][PDF] Bayesian Deep Multi-Agent Multimodal Reinforcement Learning for Embedded Systems in Games, Natural Language Processing and Robotics

I Kourouklides - 2022 - files.osf.io
Abstract Nowadays, Machine Learning is one of the most dynamic fields, as it attracts strong
research interest from both industry and academia alike. It is not surprising that a huge …

A Text-Image Pair Is Not Enough: Language-Vision Relation Inference with Auxiliary Modality Translation

W Lu, D Zhang, S Li, G Zhou - CCF International Conference on Natural …, 2023 - Springer
The semantic relations between language and vision modalities become more and more
vital since they can effectively facilitate downstream multi-modal tasks. Although several …

[HTML][HTML] 基于强化学习的生成式对话系统研究

颜永, 白宗文 - Hans Journal of Data Mining, 2023 - hanspub.org
构建一个回复多样性的开放型对话系统模型, 以尝试解决对话系统在回复过程中回答单调的问题
. 提出一种融合双向长短期记忆神经网络和强化学习模型的生成式对话方法. 首先 …

Imaginations Generate Images for Multi-modal Machine Translation

X Yang, W Sun, W Wei, Y Li, X Shi - International Conference on Computer …, 2023 - Springer
Multi-modal machine translation (MMT) aims at exploring better translation systems by
integrating the visual annotation which presents the content described in the bilingual …

[HTML][HTML] Adding visual attention into encoder-decoder model for multi-modal machine translation

C Xu, Z Yu, X Shi, F Chen - Journal of Engineering Research, 2023 - Elsevier
Multi-modal neural machine translation (MNMT) aims to integrate visual and textual
information to translate source sentences into target and attracts a lot of attentions. Existing …