Audio-visual recognition of overlapped speech for the lrs2 dataset J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 99 | 2020 |
Investigation of data augmentation techniques for disordered speech recognition M Geng, X Xie, S Liu, J Yu, S Hu, X Liu, H Meng arXiv preprint arXiv:2201.05562, 2022 | 60 | 2022 |
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng Interspeech, 2938-2942, 2018 | 55 | 2018 |
Recent progress in the CUHK dysarthric speech recognition system S Liu, M Geng, S Hu, X Xie, M Cui, J Yu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2267-2281, 2021 | 53 | 2021 |
Adversarial data augmentation for disordered speech recognition Z Jin, M Geng, X Xie, J Yu, S Liu, X Liu, H Meng arXiv preprint arXiv:2108.00899, 2021 | 38 | 2021 |
Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus Z Ye, S Hu, J Li, X Xie, M Geng, J Yu, J Xu, B Xue, S Liu, X Liu, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 32 | 2021 |
Neural architecture search for LF-MMI trained time delay neural networks S Hu, X Xie, M Cui, J Deng, S Liu, J Yu, M Geng, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1093-1107, 2022 | 28 | 2022 |
Bayesian transformer language models for speech recognition B Xue, J Yu, J Xu, S Liu, S Hu, Z Ye, M Geng, X Liu, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 25 | 2021 |
Audio-visual multi-channel integration and recognition of overlapped speech J Yu, SX Zhang, B Wu, S Liu, S Hu, M Geng, X Liu, H Meng, D Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2067-2082, 2021 | 24 | 2021 |
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition. S Liu, X Xie, J Yu, S Hu, M Geng, R Su, SX Zhang, X Liu, H Meng Interspeech, 711-715, 2020 | 21 | 2020 |
Spectro-temporal deep features for disordered speech assessment and recognition M Geng, S Liu, J Yu, X Xie, S Hu, Z Ye, Z Jin, X Liu, H Meng arXiv preprint arXiv:2201.05554, 2022 | 20 | 2022 |
The CUHK Dysarthric Speech Recognition Systems for English and Cantonese. S Hu, S Liu, HF Chang, M Geng, J Chen, LW Chung, TK Hei, J Yu, ... INTERSPEECH, 3669-3670, 2019 | 19 | 2019 |
Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition. S Liu, S Hu, Y Wang, J Yu, R Su, X Liu, H Meng INTERSPEECH (Best Student Paper Nomination), 4120-4124, 2019 | 19 | 2019 |
Music understanding LLaMA: Advancing text-to-music generation with question answering and captioning S Liu, AS Hussain, C Sun, Y Shan ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 18 | 2024 |
On the Use of Pitch Features for Disordered Speech Recognition. S Liu, S Hu, X Liu, H Meng Interspeech, 4130-4134, 2019 | 17 | 2019 |
Bayesian learning of LF-MMI trained time delay neural networks for speech recognition S Hu, X Xie, S Liu, J Yu, Z Ye, M Geng, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1514-1529, 2021 | 16 | 2021 |
Bayesian and gaussian process neural networks for large vocabulary continuous speech recognition S Hu, MWY Lam, X Xie, S Liu, J Yu, X Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 15 | 2019 |
Limited-memory bfgs optimization of recurrent neural network language models for speech recognition X Liu, S Liu, J Sha, J Yu, Z Xu, X Chen, H Meng 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 15 | 2018 |
Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition S Hu, S Liu, X Xie, M Geng, T Wang, S Hu, M Cui, X Liu, H Meng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 14 | 2022 |
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition. J Deng, FR Gutierrez, S Hu, M Geng, X Xie, Z Ye, S Liu, J Yu, X Liu, ... Interspeech, 4818-4822, 2021 | 14 | 2021 |