Improving deep neural network acoustic models using generalized maxout networks X Zhang, J Trmal, D Povey, S Khudanpur 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 401 | 2014 |
Parallel training of deep neural networks with natural gradient and parameter averaging D Povey, X Zhang, S Khudanpur arXiv preprint arXiv:1410.7455, 124, 2014 | 396 | 2014 |
Transformer-based acoustic modeling for hybrid speech recognition Y Wang, A Mohamed, D Le, C Liu, A Xiao, J Mahadeokar, H Huang, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 243 | 2020 |
Scaling speech technology to 1,000+ languages V Pratap, A Tjandra, B Shi, P Tomasello, A Babu, S Kundu, A Elkahky, ... Journal of Machine Learning Research (JMLR), 2024 | 124 | 2024 |
Improving Speaker Recognition Performance in the Domain Adaptation Challenge using Deep Neural Networks D Garcia-Romero, X Zhang, A McCree, D Povey Proc. SLT, 2014 | 108 | 2014 |
From senones to chenones: Tied context-dependent graphemes for hybrid speech recognition D Le, X Zhang, W Zheng, C Fügen, G Zweig, ML Seltzer 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 65 | 2019 |
A KEYWORD SEARCH SYSTEM USING OPEN SOURCE SOFTWARE J Trmal, G Chen, D Povey, S Khudanpur, P Ghahremani, X Zhang, ... Proc. SLT, 2014 | 50 | 2014 |
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ... Interspeech, 3597-3601, 2017 | 47 | 2017 |
Deja-vu: Double feature presentation and iterated loss in deep transformer networks A Tjandra, C Liu, F Zhang, X Zhang, Y Wang, G Synnaeve, S Nakamura, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 44 | 2020 |
Backstitch: Counteracting Finite-Sample Bias via Negative Steps. Y Wang, V Peddinti, H Xu, X Zhang, D Povey, S Khudanpur Interspeech, 1631-1635, 2017 | 32 | 2017 |
Towards measuring fairness in speech recognition: Casual conversations dataset transcriptions C Liu, M Picheny, L Sarı, P Chitkara, A Xiao, X Zhang, M Chou, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 31 | 2022 |
Faster, simpler and more accurate hybrid asr systems using wordpieces F Zhang, Y Wang, X Zhang, C Liu, Y Saraf, G Zweig Interspeech 2020, 2020 | 29 | 2020 |
Multilingual graphemic hybrid ASR with massive data augmentation C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig arXiv preprint arXiv:1909.06522, 2019 | 27 | 2019 |
A diversity-penalizing ensemble training method for deep learning X Zhang, D Povey, S Khudanpur Sixteenth Annual Conference of the International Speech Communication …, 2015 | 26 | 2015 |
Benchmarking lf-mmi, ctc and rnn-t criteria for streaming asr X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ... 2021 IEEE spoken language technology workshop (SLT), 46-51, 2021 | 22 | 2021 |
Accent-robust automatic speech recognition using supervised and unsupervised wav2vec embeddings J Li, V Manohar, P Chitkara, A Tjandra, M Picheny, F Zhang, X Zhang, ... arXiv preprint arXiv:2110.03520, 2021 | 18 | 2021 |
Torchaudio-squim: Reference-less speech quality and intelligibility measures in torchaudio A Kumar, K Tan, Z Ni, P Manocha, X Zhang, E Henderson, B Xu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 17 | 2023 |
Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework X Zhang, V Manohar, D Povey, S Khudanpur Interspeech 2017, 2017 | 13 | 2017 |
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models X Zhang, V Manohar, D Zhang, F Zhang, Y Shi, N Singhal, J Chan, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 11 | 2021 |
Omni-sparsity dnn: Fast sparsity optimization for on-device streaming e2e asr via supernet H Yang, Y Shangguan, D Wang, M Li, P Chuang, X Zhang, G Venkatesh, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |