An investigation of deep neural networks for noise robust speech recognition ML Seltzer, D Yu, Y Wang 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 804 | 2013 |
CNTK: Microsoft's open-source deep-learning toolkit F Seide, A Agarwal Proceedings of the 22nd ACM SIGKDD international conference on knowledge …, 2016 | 637 | 2016 |
Towards end-to-end spoken language understanding D Serdyuk, Y Wang, C Fuegen, A Kumar, B Liu, Y Bengio 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 263 | 2018 |
Transformer-based acoustic modeling for hybrid speech recognition Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 255 | 2020 |
Google usm: Scaling automatic speech recognition beyond 100 languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023 | 200 | 2023 |
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022 | 171 | 2022 |
Transformer-transducer: End-to-end speech recognition with self-attention CF Yeh, J Mahadeokar, K Kalgaonkar, Y Wang, D Le, M Jain, K Schubert, ... arXiv preprint arXiv:1910.12977, 2019 | 170 | 2019 |
Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition Y Shi, Y Wang, C Wu, CF Yeh, J Chan, F Zhang, D Le, M Seltzer ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 166 | 2021 |
Efficient lattice rescoring using recurrent neural network language models X Liu, Y Wang, X Chen, MJF Gales, PC Woodland 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 113 | 2014 |
Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023 | 111 | 2023 |
Simplifying long short-term memory acoustic models for fast training and decoding Y Miao, J Li, Y Wang, SX Zhang, Y Gong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 100 | 2016 |
Adaptation of deep neural network acoustic models using factorised i-vectors. P Karanasou, Y Wang, MJF Gales, PC Woodland Interspeech 2014, 2180-2184, 2014 | 83 | 2014 |
Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch. X Chen, Y Wang, X Liu, MJF Gales, PC Woodland Interspeech 14, 641-645, 2014 | 82 | 2014 |
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory C Wu, Y Wang, Y Shi, CF Yeh, F Zhang arXiv preprint arXiv:2005.08042, 2020 | 67 | 2020 |
Speaker and noise factorization for robust speech recognition Y Wang, MJF Gales IEEE Transactions on Audio, Speech, and Language Processing 20 (7), 2149-2158, 2012 | 61 | 2012 |
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR. Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen Interspeech, 3490-3494, 2019 | 56 | 2019 |
End-to-end contextual speech recognition using class language models and a token passing decoder Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 54 | 2019 |
Investigations on speaker adaptation of LSTM RNN models for speech recognition C Liu, Y Wang, K Kumar, Y Gong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 54 | 2016 |
Deja-vu: Double feature presentation and iterated loss in deep transformer networks A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 48 | 2020 |
Small-footprint high-performance deep neural network-based speech recognition using split-VQ Y Wang, J Li, Y Gong 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 48 | 2015 |