Multi-agent learning for neural machine translation

Y Zhao, RH Zhang, S Zhou, Z Zhang - Findings of the Association …, 2020 - aclanthology.org

Active learning is an efficient approach for mitigating data dependency when training neural
machine translation (NMT) models. In this paper, we explore new training frameworks by …

被引用次数：40 相关文章

[PDF] aclanthology.org

Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation

Z Miao, W Zhang, J Su, X Li, J Luan… - Proceedings of the …, 2023 - aclanthology.org

Conventional knowledge distillation (KD) approaches are commonly employed to compress
neural machine translation (NMT) models. However, they only obtain one lightweight …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Exploring the intersection of large language models and agent-based modeling via prompt engineering

E Junprung - arXiv preprint arXiv:2308.07411, 2023 - arxiv.org

The final frontier for simulation is the accurate representation of complex, real-world social
systems. While agent-based modeling (ABM) seeks to study the behavior and interactions of …

被引用次数：16 相关文章所有 2 个版本

[PDF] arxiv.org

Guiding teacher forcing with seer forcing for neural machine translation

Y Feng, S Gu, D Guo, Z Yang, C Shao - arXiv preprint arXiv:2106.06751, 2021 - arxiv.org

Although teacher forcing has become the main training paradigm for neural machine
translation, it usually makes predictions only conditioned on past information, and hence …

被引用次数：15 相关文章所有 4 个版本

[PDF] arxiv.org

Improved training of mixture-of-experts language gans

Y Chai, Q Yin, J Zhang - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org

Despite the dramatic success in image generation, Generative Adversarial Networks (GANs)
still face great challenges in text generation. The difficulty in generator training arises from …

被引用次数：4 相关文章所有 3 个版本

Dualformer: a unified bidirectional sequence-to-sequence learning

JT Chien, WH Chang - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org

This paper presents a new dual domain mapping based on a unified bidirectional sequence-
to-sequence (seq2seq) learning. Traditionally, dual learning in domain mapping was …

被引用次数：9 相关文章所有 2 个版本

Exploring iterative dual domain adaptation for neural machine translation

X Liu, J Zeng, X Wang, Z Wang, J Su - Knowledge-Based Systems, 2024 - Elsevier

Abstract Domain adaptation for neural machine translation (NMT) has always been a hot
research topic in the community of machine translation. Generally, previous studies focus on …

被引用次数：2 相关文章所有 3 个版本

[PDF] aclanthology.org

Collaborative learning of bidirectional decoders for unsupervised text style transfer

Y Ma, Y Chen, X Mao, Q Li - … of the 2021 Conference on Empirical …, 2021 - aclanthology.org

Unsupervised text style transfer aims to alter the underlying style of the text to a desired
value while keeping its style-independent semantics, without the support of parallel training …

被引用次数：7 相关文章所有 4 个版本

[PDF] aclanthology.org

Multi-agent mutual learning at sentence-level and token-level for neural machine translation

B Liao, Y Gao, H Ney - Findings of the Association for …, 2020 - aclanthology.org

Mutual learning, where multiple agents learn collaboratively and teach one another, has
been shown to be an effective way to distill knowledge for image classification tasks. In this …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

Schema-Guided Culture-Aware Complex Event Simulation with Multi-Agent Role-Play

S Li, RG Reddy, KD Nguyen, Q Wang, M Fung… - arXiv preprint arXiv …, 2024 - arxiv.org

Complex news events, such as natural disasters and socio-political conflicts, require swift
responses from the government and society. Relying on historical events to project the future …