A spelling correction model for end-to-end speech recognition J Guo, TN Sainath, RJ Weiss ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 151 | 2019 |
Effectiveness of voice quality features in detecting depression A Afshan, J Guo, SJ Park, V Ravi, J Flint, A Alwan Interspeech 2018, 2018 | 69 | 2018 |
Attention based CLDNNs for short-duration acoustic scene classification J Guo, N Xu, LJ Li, A Alwan Interspeech 2017, 2017 | 57 | 2017 |
Prompting large language models with speech recognition abilities Y Fathullah, C Wu, E Lakomkin, J Jia, Y Shangguan, K Li, J Guo, W Xiong, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 55 | 2024 |
Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition J Guo, G Tiwari, J Droppo, M Van Segbroeck, CW Huang, A Stolcke, ... arXiv preprint arXiv:2007.13802, 2020 | 53 | 2020 |
Time-delayed bottleneck Highway Networks using a DFT feature for keyword spotting J Guo, K Kumatani, M Sun, M Wu, A Raju, N Strom, A Mandal ICASSP 2018, 2018 | 45 | 2018 |
Singing voice conversion with non-parallel data X Chen, W Chu, J Guo, N Xu 2019 IEEE Conference on Multimedia Information Processing and Retrieval …, 2019 | 39 | 2019 |
Speaker Identity and Voice Quality: Modeling Human Responses and Automatic Speaker Recognition. SJ Park, C Sigouin, J Kreiman, PA Keating, J Guo, G Yeung, FY Kuo, ... Interspeech, 1044-1048, 2016 | 32 | 2016 |
Redat: Accent-invariant representation for end-to-end asr by domain adversarial training with relabeling H Hu, X Yang, Z Raeesy, J Guo, G Keskin, H Arsikere, A Rastrow, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 27 | 2021 |
Deep neural network based i-vector mapping for speaker verification using short utterances J Guo, N Xu, K Qian, Y Shi, K Xu, Y Wu, A Alwan Speech Communication 105, 92-102, 2018 | 27 | 2018 |
Robust speaker identification via fusion of subglottal resonances and cepstral features J Guo, R Yang, H Arsikere, A Alwan the Journal of the Acoustical Society of America 141 (4), EL420-EL426, 2017 | 16 | 2017 |
CNN-Based Joint Mapping of Short and Long Utterance i-Vectors for Speaker Verification Using Short Utterances. J Guo, UA Nookala, A Alwan INTERSPEECH, 3712-3716, 2017 | 15 | 2017 |
Speaker Verification Using Short Utterances with DNN-Based Estimation of Subglottal Acoustic Features. J Guo, G Yeung, D Muralidharan, H Arsikere, A Afshan, A Alwan INTERSPEECH, 2219-2222, 2016 | 15 | 2016 |
Age-dependent height estimation and speaker normalization for children's speech using the first three subglottal resonances. J Guo, R Paturi, G Yeung, SM Lulich, H Arsikere, A Alwan INTERSPEECH, 1665-1669, 2015 | 13 | 2015 |
Subglottal resonances of American English speaking children G Yeung, SM Lulich, J Guo, MS Sommers, A Alwan The Journal of the Acoustical Society of America 144 (6), 3437-3449, 2018 | 9 | 2018 |
Filter sampling and combination CNN (FSC-CNN): a compact CNN model for small-footprint ASR acoustic modeling using raw waveforms J Guo, N Xu, X Chen, Y Shi, K Xu, A Alwan INTERSPEECH 2018, 2018 | 8 | 2018 |
Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification A Afshan, J Guo, SJ Park, V Ravi, A McCree, A Alwan arXiv preprint arXiv:2008.03616, 2020 | 7 | 2020 |
Improving fast-slow encoder based transducer with streaming deliberation K Li, J Mahadeokar, J Guo, Y Shi, G Keren, O Kalinli, ML Seltzer, D Le ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 5 | 2023 |
Biased self-supervised learning for ASR FL Kreyssig, Y Shi, J Guo, L Sari, A Mohamed, PC Woodland arXiv preprint arXiv:2211.02536, 2022 | 4 | 2022 |
Acoustic neural network scene detection J Guo, J Li, N Xu US Patent 10,878,837, 2020 | 4 | 2020 |