Fluent and low-latency simultaneous speech-to-speech translation with self-adaptive training

R Zheng, J Chen, M Ma… - … Conference on Machine …, 2021 - proceedings.mlr.press

Recently, representation learning for text and speech has successfully improved many
language related tasks. However, all existing methods suffer from two limitations:(a) they …

被引用次数：63 相关文章所有 4 个版本

[PDF] arxiv.org

Paddlespeech: An easy-to-use all-in-one speech toolkit

H Zhang, T Yuan, J Chen, X Li, R Zheng… - arXiv preprint arXiv …, 2022 - arxiv.org

PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the
development and research of speech processing technologies by providing an easy-to-use …

被引用次数：20 相关文章所有 5 个版本

[PDF] arxiv.org

Direct simultaneous speech-to-text translation assisted by synchronized streaming ASR

J Chen, M Ma, R Zheng, L Huang - arXiv preprint arXiv:2106.06636, 2021 - arxiv.org

Simultaneous speech-to-text translation is widely useful in many scenarios. The
conventional cascaded approach uses a pipeline of streaming ASR followed by …

被引用次数：26 相关文章所有 4 个版本

[PDF] arxiv.org

Incremental text-to-speech synthesis with prefix-to-prefix framework

M Ma, B Zheng, K Liu, R Zheng, H Liu, K Peng… - arXiv preprint arXiv …, 2019 - arxiv.org

Text-to-speech synthesis (TTS) has witnessed rapid progress in recent years, where neural
methods became capable of producing audios with high naturalness. However, these efforts …

被引用次数：34 相关文章所有 4 个版本

[PDF] uzh.ch

ELITR multilingual live subtitling: Demo and strategy

O Bojar, D Macháček, S Sagar, O Smrž, J Kratochvíl… - 2021 - zora.uzh.ch

This paper presents an automatic speech translation system aimed at live subtitling of
conference presentations. We describe the overall architecture and key processing …

被引用次数：16 相关文章所有 10 个版本

[PDF] arxiv.org

Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach

J Chen, J Xue, P Wang, J Pan… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

Simultaneous Speech-to-Text translation serves a critical role in real-time crosslingual
communication. Despite the advancements in recent years, challenges remain in achieving …

被引用次数：1 相关文章所有 3 个版本

Low-latency incremental text-to-speech synthesis with distilled context prediction network

T Saeki, S Takamichi… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org

Incremental text-to-speech (TTS) synthesis generates utterances in small linguistic units for
the sake of real-time and low-latency applications. We previously proposed an incremental …

被引用次数：3 相关文章所有 5 个版本

[PDF] aclanthology.org

Barriers to Effective Evaluation of Simultaneous Interpretation

S Wein, I Te, C Cherry, J Juraska… - Findings of the …, 2024 - aclanthology.org

Simultaneous interpretation is an especially challenging form of translation because it
requires converting speech from one language to another in real-time. Though prior work …

[PDF] arxiv.org

Direct simultaneous speech-to-speech translation with variational monotonic multihead attention

X Ma, H Gong, D Liu, A Lee, Y Tang, PJ Chen… - arXiv preprint arXiv …, 2021 - arxiv.org

We present a direct simultaneous speech-to-speech translation (Simul-S2ST) model,
Furthermore, the generation of translation is independent from intermediate text …

被引用次数：2 相关文章所有 2 个版本

[PDF] jhu.edu

End-to-End Simultaneous Speech Translation

X Ma - 2022 - jscholarship.library.jhu.edu

Speech translation is the task of translating speech in one language to text or speech in
another language, while simultaneous translation aims at lower translation latency by …