Explore wav2vec 2.0 for Mispronunciation Detection X Xu, Y Kang, S Cao, B Lin, L Ma Proc. Interspeech 2021, 4428-4432, 2021 | 62 | 2021 |
Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-supervised Learning K Deng, S Cao, L Ma Proc. Interspeech 2021, 1504-1508, 2021 | 30 | 2021 |
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model K Deng, S Cao, Y Zhang, L Ma 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 76-82, 2021 | 29 | 2021 |
Improving CTC-based speech recognition via knowledge transferring from pre-trained language models K Deng, S Cao, Y Zhang, L Ma, G Cheng, J Xu, P Zhang ICASSP 2022, 2022 | 26 | 2022 |
Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning S Cao, Y Kang, Y Fu, X Xu, S Sun, Y Zhang, L Ma Proc. Interspeech 2021, 706-710, 2021 | 15 | 2021 |
Multi-head monotonic chunkwise attention for online speech recognition B Liu, S Cao, S Sun, W Zhang, L Ma arXiv preprint arXiv:2005.00205, 2020 | 9 | 2020 |
Improving speech recognition accuracy of local poi using geographical models S Cao, Y Zhang, X Feng, L Ma 2021 IEEE Spoken Language Technology Workshop (SLT), 180-185, 2021 | 5 | 2021 |
DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model Y Fu, Y Kang, S Cao, L Ma arXiv preprint arXiv:2303.09278, 2023 | 4 | 2023 |
Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training B Zhang, S Cao, X Zhang, Y Zhang, L Ma, T Shinozaki Interspeech 2022, 2022 | 2 | 2022 |
A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling Y Zhang, X Feng, Y Liu, S Cao, L Ma arXiv preprint arXiv:2203.04767, 2022 | | 2022 |