Wavlm: Large-scale self-supervised pre-training for full stack speech processing S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022 | 1176 | 2022 |
Recent advances in deep learning for speech research at Microsoft L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ... Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International …, 2013 | 1030 | 2013 |
Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers JT Huang, J Li, D Yu, L Deng, Y Gong 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 770 | 2013 |
An overview of noise-robust automatic speech recognition J Li, L Deng, Y Gong, R Haeb-Umbach IEEE/ACM Transactions on Audio, Speech, and Language Processing 22 (4), 745-777, 2014 | 665 | 2014 |
Restructuring of deep neural network acoustic models with singular value decomposition. J Xue, J Li, Y Gong INTERSPEECH, 2365-2369, 2013 | 509 | 2013 |
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2301.02111, 2023 | 371 | 2023 |
Learning small-size DNN with output-distribution-based criteria. J Li, R Zhao, JT Huang, Y Gong INTERSPEECH, 1910-1914, 2014 | 347 | 2014 |
Recent advances in end-to-end automatic speech recognition J Li APSIPA Transactions on Signal and Information Processing 11 (1), 2022 | 320 | 2022 |
Feature learning in deep neural networks-studies on speech recognition tasks D Yu, ML Seltzer, J Li, JT Huang, F Seide arXiv preprint arXiv:1301.3605, 2013 | 318 | 2013 |
Restructuring deep neural network acoustic models J Xue, E Stoimenov, J Li, Y Gong US Patent 9,728,184, 2017 | 249 | 2017 |
Restructuring deep neural network acoustic models J Xue, E Stoimenov, J Li, Y Gong US Patent 9,728,184, 2017 | 249 | 2017 |
Shared hidden layer combination for speech recognition systems J Li, J Xue, Y Gong US Patent 9,520,127, 2016 | 230 | 2016 |
Shared hidden layer combination for speech recognition systems J Li, J Xue, Y Gong US Patent 9,520,127, 2016 | 230 | 2016 |
Variable-component deep neural network for robust speech recognition J Li, R Zhao, Y Gong US Patent 10,019,990, 2018 | 216 | 2018 |
Continuous speech separation: dataset and analysis Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 205 | 2020 |
End-to-End attention based text-dependent speaker verification SX Zhang, Z Chen, Y Zhao, J Li, Y Gong Spoken Language Technology Workshop (SLT), 2016 IEEE, 171-178, 2016 | 204 | 2016 |
Robust Automatic Speech Recognition: A Bridge to Practical Applications J Li, L Deng, R Haeb-Umbach, Y Gong Academic Press, 2015 | 200 | 2015 |
Improving RNN Transducer Modeling for End-to-End Speech Recognition J Li, R Zhao, H Hu, Y Gong 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2019 | 195 | 2019 |
Singular Value Decomposition Based Low-footprint Speaker Adaptation and Personalization for Deep Neural Network J Xue, J Li, D Yu, M Seltzer, Y Gong ICASSP, 2014 | 195 | 2014 |
Recent progresses in deep learning based acoustic models D Yu, J Li IEEE/CAA Journal of Automatica Sinica 4 (3), 396-409, 2017 | 190 | 2017 |