Hindi visual genome: A dataset for multi-modal english to hindi machine translation

T Nakazawa, H Nakayama, C Ding… - Proceedings of the …, 2021 - aclanthology.org

This paper presents the results of the shared tasks from the 8th workshop on Asian
translation (WAT2021). For the WAT2021, 28 teams participated in the shared tasks and 24 …

被引用次数：168 相关文章所有 15 个版本

[PDF] arxiv.org

Universal multimodal representation for language understanding

Z Zhang, K Chen, R Wang, M Utiyama… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Representation learning is the foundation of natural language processing (NLP). This work
presents new methods to employ visual information as assistant signals to general NLP …

被引用次数：13 相关文章所有 8 个版本

[PDF] academia.edu

An encoder-decoder based framework for hindi image caption generation

A Singh, TD Singh, S Bandyopadhyay - Multimedia tools and applications, 2021 - Springer

In recent times, research activity on image caption generation has attracted several
researchers. The present work attempt to address the problem of Hindi image caption …

被引用次数：24 相关文章所有 5 个版本

[PDF] aclanthology.org

Multimodality for NLP-centered applications: Resources, advances and frontiers

M Garg, S Wazarkar, M Singh… - Proceedings of the …, 2022 - aclanthology.org

With the development of multimodal systems and natural language generation techniques,
the resurgence of multimodal datasets has attracted significant research interests, which …

被引用次数：12 相关文章所有 5 个版本

[PDF] sciencedirect.com

Hindi to English Multimodal Machine Translation on News Dataset in Low Resource Setting

LS Meetei, SM Singh, A Singh, R Das, TD Singh… - Procedia Computer …, 2023 - Elsevier

This work proposes a multimodal Hindi-to-English machine translation on a news corpus by
integrating multiple input modalities. The experimental dataset comprises of an image from a …

被引用次数：10 相关文章所有 2 个版本

Exploiting multiple correlated modalities can enhance low-resource machine translation quality

LS Meetei, TD Singh, S Bandyopadhyay - Multimedia Tools and …, 2024 - Springer

In an effort to enhance the machine translation (MT) quality of low-resource languages, we
report the first study on multimodal machine translation (MMT) for Manipuri→ English …

被引用次数：5 相关文章

[PDF] arxiv.org

Hausa visual genome: A dataset for multi-modal English to Hausa machine translation

I Abdulmumin, SR Dash, MA Dawud, S Parida… - arXiv preprint arXiv …, 2022 - arxiv.org

Multi-modal Machine Translation (MMT) enables the use of visual information to enhance
the quality of translations. The visual information can serve as a valuable piece of context …

被引用次数：10 相关文章所有 10 个版本

[PDF] aclanthology.org

A visually-grounded parallel corpus with phrase-to-region linking

H Nakayama, A Tamura… - Proceedings of the Twelfth …, 2020 - aclanthology.org

Visually-grounded natural language processing has become an important research
direction in the past few years. However, majorities of the available cross-modal resources …

被引用次数：23 相关文章所有 3 个版本

[PDF] arxiv.org

Impact of visual context on noisy multimodal NMT: an empirical study for English to Indian languages

B Gain, D Bandyopadhyay, S Mukherjee… - arXiv preprint arXiv …, 2023 - arxiv.org

The study investigates the effectiveness of utilizing multimodal information in Neural
Machine Translation (NMT). While prior research focused on using multimodal data in low …

被引用次数：2 相关文章所有 2 个版本

Exploring practical deep learning approaches for English-to-Hindi image caption translation using transformers and object detectors

P Bisht, A Solanki - Applications of Artificial Intelligence and Machine …, 2022 - Springer

Most of the captions available for images are only present in a few languages prominent on
the internet. The task of machine translation of image captions aims to democratize this …

被引用次数：6 相关文章所有 3 个版本