Language-routing mixture of experts for multilingual and code-switching speech recognition

W Wang, G Ma, Y Li, B Du - arXiv preprint arXiv:2307.05956, 2023 - arxiv.org
Multilingual speech recognition for both monolingual and code-switching speech is a
challenging task. Recently, based on the Mixture of Experts (MoE), many works have made …

Learning asr pathways: A sparse multilingual asr model

M Yang, A Tjandra, C Liu, D Zhang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Neural network pruning compresses automatic speech recognition (ASR) models effectively.
However, in multilingual ASR, language-agnostic pruning may lead to severe performance …

Towards zero-shot code-switched speech recognition

B Yan, M Wiesner, O Klejch, P Jyothi… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this work, we seek to build effective code-switched (CS) automatic speech recognition
systems (ASR) under the zero-shot set-ting where no transcribed CS speech data is …

Language-specific acoustic boundary learning for mandarin-english code-switching speech recognition

Z Fan, L Dong, C Shen, Z Liang, J Zhang, L Lu… - arXiv preprint arXiv …, 2023 - arxiv.org
Code-switching speech recognition (CSSR) transcribes speech that switches between
multiple languages or dialects within a single sentence. The main challenge in this task is …

Semi-supervised learning for code-switching asr with large language model filter

Y Xi, W Ding, K Yu, J Lai - arXiv preprint arXiv:2407.04219, 2024 - arxiv.org
Code-switching (CS) phenomenon occurs when words or phrases from different languages
are alternated in a single sentence. Due to data scarcity, building an effective CS Automatic …

Improving multilingual and code-switching asr using large language model generated text

K Hu, TN Sainath, B Li, Y Zhang… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
We investigate using large language models (LLMs) to generate text-only training data for
improving multilingual and code-switching automatic speech recognition (ASR) through a …

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Z Liang, Z Song, Z Ma, C Du, K Yu, X Chen - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, end-to-end (E2E) automatic speech recognition (ASR) models have made great
strides and exhibit excellent performance in general speech recognition. However, there …

Sc-moe: Switch conformer mixture of experts for unified streaming and non-streaming code-switching asr

S Ye, S Chen, X Hu, X Xu - arXiv preprint arXiv:2406.18021, 2024 - arxiv.org
In this work, we propose a Switch-Conformer-based MoE system named SC-MoE for unified
streaming and non-streaming code-switching (CS) automatic speech recognition (ASR) …

BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition

P Chen, F Yu, Y Liang, H Xue, X Wan… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Mixture-of-experts based models, which use language experts to extract language-specific
representations effectively, have been well applied in code-switching automatic speech …

LAE-ST-MOE: Boosted Language-Aware Encoder Using Speech Translation Auxiliary Task for E2E Code-Switching ASR

G Ma, W Wang, Y Li, Y Yang, B Du… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Recently, to mitigate the confusion between different languages in code-switching (CS)
automatic speech recognition (ASR), the conditionally factorized models, such as the …