Neural machine translation for low-resource languages: A survey

S Ranathunga, ESA Lee, M Prifti Skenduli… - ACM Computing …, 2023 - dl.acm.org
Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …

Redistributing low-frequency words: Making the most of monolingual data in non-autoregressive translation

L Ding, L Wang, S Shi, D Tao, Z Tu - … of the 60th Annual Meeting of …, 2022 - aclanthology.org
Abstract Knowledge distillation (KD) is the preliminary step for training non-autoregressive
translation (NAT) models, which eases the training of NAT models at the cost of losing …

Tagged back-translation

I Caswell, C Chelba, D Grangier - arXiv preprint arXiv:1906.06442, 2019 - arxiv.org
Recent work in Neural Machine Translation (NMT) has shown significant quality gains from
noised-beam decoding during back-translation, a method to generate synthetic parallel …

A voyage on neural machine translation for Indic languages

SK Sheshadri, D Gupta, MR Costa-Jussà - Procedia Computer Science, 2023 - Elsevier
With the invention of deep learning concepts, Machine Translation (MT) migrated towards
Neural Machine Translation (NMT) architectures, eventually from Statistical Machine …

Improved lexically constrained decoding for translation and monolingual rewriting

JE Hu, H Khayrallah, R Culkin, P Xia… - Proceedings of the …, 2019 - aclanthology.org
Lexically-constrained sequence decoding allows for explicit positive or negative phrase-
based constraints to be placed on target output strings in generation tasks such as machine …

Rejuvenating low-frequency words: Making the most of parallel data in non-autoregressive translation

L Ding, L Wang, X Liu, DF Wong, D Tao… - arXiv preprint arXiv …, 2021 - arxiv.org
Knowledge distillation (KD) is commonly used to construct synthetic data for training non-
autoregressive translation (NAT) models. However, there exists a discrepancy on low …

Improving back-translation with uncertainty-based confidence estimation

S Wang, Y Liu, C Wang, H Luan, M Sun - arXiv preprint arXiv:1909.00157, 2019 - arxiv.org
While back-translation is simple and effective in exploiting abundant monolingual corpora to
improve low-resource neural machine translation (NMT), the synthetic bilingual corpora …

Fast and accurate neural machine translation with translation memory

Q He, G Huang, Q Cui, L Li, L Liu - … of the 59th Annual Meeting of …, 2021 - aclanthology.org
It is generally believed that a translation memory (TM) should be beneficial for machine
translation tasks. Unfortunately, existing wisdom demonstrates the superiority of TM-based …

[PDF][PDF] Leveraging orthographic information to improve machine translation of under-resourced languages

A Chakravarthi, B Raja - 2020 - aran.library.nuigalway.ie
This thesis presents Bharathi Raja Asoka Chakravarthi's work on machine translation for
underresourced languages. As the internet becomes increasingly available to the whole …

Enhancing social network hate detection using back translation and GPT-3 augmentations during training and test-time

S Cohen, D Presil, O Katz, O Arbili, S Messica… - information …, 2023 - Elsevier
Social media platforms have become an essential means of communication, but they also
serve as a breeding ground for hateful content. Detecting hate speech accurately is …