[HTML][HTML] Data augmentation approaches in natural language processing: A survey

B Li, Y Hou, W Che - Ai Open, 2022 - Elsevier
As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where
deep learning techniques may fail. It is widely applied in computer vision then introduced to …

A survey of data augmentation approaches for NLP

SY Feng, V Gangal, J Wei, S Chandar… - arXiv preprint arXiv …, 2021 - arxiv.org
Data augmentation has recently seen increased interest in NLP due to more work in low-
resource domains, new tasks, and the popularity of large-scale neural networks that require …

Domain adaptation and multi-domain adaptation for neural machine translation: A survey

D Saunders - Journal of Artificial Intelligence Research, 2022 - jair.org
The development of deep learning techniques has allowed Neural Machine Translation
(NMT) models to become extremely powerful, given sufficient training data and training time …

Overview of the 8th workshop on Asian translation

T Nakazawa, H Nakayama, C Ding… - Proceedings of the …, 2021 - aclanthology.org
This paper presents the results of the shared tasks from the 8th workshop on Asian
translation (WAT2021). For the WAT2021, 28 teams participated in the shared tasks and 24 …

Dictionary-based phrase-level prompting of large language models for machine translation

M Ghazvininejad, H Gonen, L Zettlemoyer - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) demonstrate remarkable machine translation (MT) abilities
via prompting, even though they were not explicitly trained for this task. However, even given …

Alignment-augmented consistent translation for multilingual open information extraction

K Kolluru, M Mohammed, S Mittal… - Proceedings of the …, 2022 - aclanthology.org
Abstract Progress with supervised Open Information Extraction (OpenIE) has been primarily
limited to English due to the scarcity of training data in other languages. In this paper, we …

Accurate word alignment induction from neural machine translation

Y Chen, Y Liu, G Chen, X Jiang, Q Liu - arXiv preprint arXiv:2004.14837, 2020 - arxiv.org
Despite its original goal to jointly learn to align and translate, prior researches suggest that
Transformer captures poor word alignments through its attention mechanism. In this paper …

Prompt-driven neural machine translation

Y Li, Y Yin, J Li, Y Zhang - Findings of the Association for …, 2022 - aclanthology.org
Neural machine translation (NMT) has obtained significant performance improvement over
the recent years. However, NMT models still face various challenges including fragility and …

Constrained abstractive summarization: Preserving factual consistency with constrained generation

Y Mao, X Ren, H Ji, J Han - arXiv preprint arXiv:2010.12723, 2020 - arxiv.org
Despite significant progress, state-of-the-art abstractive summarization methods are still
prone to hallucinate content inconsistent with the source document. In this paper, we …

Gender neutralization for an inclusive machine translation: from theoretical foundations to open challenges

A Piergentili, D Fucci, B Savoldi, L Bentivogli… - arXiv preprint arXiv …, 2023 - arxiv.org
Gender inclusivity in language technologies has become a prominent research topic. In this
study, we explore gender-neutral translation (GNT) as a form of gender inclusivity and a goal …