[HTML][HTML] Progress in machine translation

H Wang, H Wu, Z He, L Huang, KW Church - Engineering, 2022 - Elsevier
After more than 70 years of evolution, great achievements have been made in machine
translation. Especially in recent years, translation quality has been greatly improved with the …

Fairseq S2T: Fast speech-to-text modeling with fairseq

C Wang, Y Tang, X Ma, A Wu, S Popuri… - arXiv preprint arXiv …, 2020 - arxiv.org
We introduce fairseq S2T, a fairseq extension for speech-to-text (S2T) modeling tasks such
as end-to-end speech recognition and speech-to-text translation. It follows fairseq's careful …

Neural machine translation: Challenges, progress and future

J Zhang, C Zong - Science China Technological Sciences, 2020 - Springer
Abstract Machine translation (MT) is a technique that leverages computers to translate
human languages automatically. Nowadays, neural machine translation (NMT) which …

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation

L Barrault, YA Chung, MC Meglioli, D Dale… - arXiv preprint arXiv …, 2023 - arxiv.org
What does it take to create the Babel Fish, a tool that can help individuals translate speech
between any two languages? While recent breakthroughs in text-based models have …

Must-c: a multilingual speech translation corpus

MA Di Gangi, R Cattoni, L Bentivogli, M Negri… - Proceedings of the …, 2019 - cris.fbk.eu
Current research on spoken language translation (SLT) has to confront with the scarcity of
sizeable and publicly available training corpora. This problem hinders the adoption of neural …

Unified speech-text pre-training for speech translation and recognition

Y Tang, H Gong, N Dong, C Wang, WN Hsu… - arXiv preprint arXiv …, 2022 - arxiv.org
We describe a method to jointly pre-train speech and text in an encoder-decoder modeling
framework for speech translation and recognition. The proposed method incorporates four …

Must-c: A multilingual corpus for end-to-end speech translation

R Cattoni, MA Di Gangi, L Bentivogli, M Negri… - Computer speech & …, 2021 - Elsevier
End-to-end spoken language translation (SLT) has recently gained popularity thanks to the
advancement of sequence to sequence learning in its two parent tasks: automatic speech …

Multilingual speech translation with efficient finetuning of pretrained models

X Li, C Wang, Y Tang, C Tran, Y Tang, J Pino… - arXiv preprint arXiv …, 2020 - arxiv.org
We present a simple yet effective approach to build multilingual speech-to-text (ST)
translation by efficient transfer learning from pretrained speech encoder and text decoder …

Direct speech-to-speech translation with a sequence-to-sequence model

Y Jia, RJ Weiss, F Biadsy, W Macherey… - arXiv preprint arXiv …, 2019 - arxiv.org
We present an attention-based sequence-to-sequence neural network which can directly
translate speech from one language into speech in another language, without relying on an …

ESPnet-ST: All-in-one speech translation toolkit

H Inaguma, S Kiyono, K Duh, S Karita… - arXiv preprint arXiv …, 2020 - arxiv.org
We present ESPnet-ST, which is designed for the quick development of speech-to-speech
translation systems in a single framework. ESPnet-ST is a new project inside end-to-end …