Pre-training on high-resource speech recognition improves low-resource speech-to-text translation

H Wang, H Wu, Z He, L Huang, KW Church - Engineering, 2022 - Elsevier

After more than 70 years of evolution, great achievements have been made in machine
translation. Especially in recent years, translation quality has been greatly improved with the …

被引用次数：221 相关文章所有 2 个版本

[PDF] mi-research.net

Transformer: A general framework from machine translation to others

Y Zhao, J Zhang, C Zong - Machine Intelligence Research, 2023 - Springer

Abstract Machine translation is an important and challenging task that aims at automatically
translating natural language sentences from one language into another. Recently …

被引用次数：29 相关文章所有 4 个版本

[PDF] arxiv.org

Fairseq S2T: Fast speech-to-text modeling with fairseq

C Wang, Y Tang, X Ma, A Wu, S Popuri… - arXiv preprint arXiv …, 2020 - arxiv.org

We introduce fairseq S2T, a fairseq extension for speech-to-text (S2T) modeling tasks such
as end-to-end speech recognition and speech-to-text translation. It follows fairseq's careful …

被引用次数：266 相关文章所有 5 个版本

[PDF] fbk.eu

Must-c: a multilingual speech translation corpus

MA Di Gangi, R Cattoni, L Bentivogli, M Negri… - Proceedings of the …, 2019 - cris.fbk.eu

Current research on spoken language translation (SLT) has to confront with the scarcity of
sizeable and publicly available training corpora. This problem hinders the adoption of neural …

被引用次数：403 相关文章所有 6 个版本

[PDF] arxiv.org

STEMM: Self-learning with speech-text manifold mixup for speech translation

Q Fang, R Ye, L Li, Y Feng, M Wang - arXiv preprint arXiv:2203.10426, 2022 - arxiv.org

How to learn a better speech representation for end-to-end speech-to-text translation (ST)
with limited labeled data? Existing techniques often attempt to transfer powerful machine …

被引用次数：94 相关文章所有 8 个版本

Must-c: A multilingual corpus for end-to-end speech translation

R Cattoni, MA Di Gangi, L Bentivogli, M Negri… - Computer speech & …, 2021 - Elsevier

End-to-end spoken language translation (SLT) has recently gained popularity thanks to the
advancement of sequence to sequence learning in its two parent tasks: automatic speech …

被引用次数：152 相关文章所有 2 个版本

[PDF] arxiv.org

Multilingual speech translation with efficient finetuning of pretrained models

X Li, C Wang, Y Tang, C Tran, Y Tang, J Pino… - arXiv preprint arXiv …, 2020 - arxiv.org

We present a simple yet effective approach to build multilingual speech-to-text (ST)
translation by efficient transfer learning from pretrained speech encoder and text decoder …

被引用次数：142 相关文章所有 6 个版本

[PDF] arxiv.org

The multilingual tedx corpus for speech recognition and translation

E Salesky, M Wiesner, J Bremerman, R Cattoni… - arXiv preprint arXiv …, 2021 - arxiv.org

We present the Multilingual TEDx corpus, built to support speech recognition (ASR) and
speech translation (ST) research across many non-English source languages. The corpus is …

被引用次数：133 相关文章所有 12 个版本

[PDF] isca-archive.org

[PDF][PDF] CoVoST 2 and Massively Multilingual Speech Translation.

C Wang, A Wu, J Gu, J Pino - Interspeech, 2021 - isca-archive.org

Speech translation (ST) is an increasingly popular topic of research, partly due to the
development of benchmark datasets. Nevertheless, current datasets cover a limited number …

被引用次数：132 相关文章所有 5 个版本

[PDF] arxiv.org

Cross-modal contrastive learning for speech translation

R Ye, M Wang, L Li - arXiv preprint arXiv:2205.02444, 2022 - arxiv.org

How can we learn unified representations for spoken utterances and their written text?
Learning similar representations for semantically similar speech and text is important for …

被引用次数：80 相关文章所有 9 个版本