A review of speaker diarization: Recent advances with deep learning TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan Computer Speech & Language 72, 101317, 2022 | 309 | 2022 |
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion M Li, KJ Han, S Narayanan Computer Speech & Language 27 (1), 151-167, 2013 | 228 | 2013 |
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap TJ Park, KJ Han, M Kumar, S Narayanan IEEE Signal Processing Letters 27, 381-385, 2019 | 121 | 2019 |
The CAPIO 2017 conversational speech recognition system KJ Han, A Chandrashekaran, J Kim, I Lane arXiv preprint arXiv:1801.00059, 2017 | 89 | 2017 |
Strategies to improve the robustness of agglomerative hierarchical clustering under data source variation for speaker diarization KJ Han, S Kim, SS Narayanan IEEE Transactions on Audio, Speech, and Language Processing 16 (8), 1590-1601, 2008 | 79 | 2008 |
State-of-the-art speech recognition using multi-stream self-attention with dilated 1d convolutions KJ Han, R Prieto, T Ma 2019 IEEE Automatic speech recognition and understanding workshop (ASRU), 54-61, 2019 | 76 | 2019 |
E-branchformer: Branchformer with enhanced merging for speech recognition K Kim, F Wu, Y Peng, J Pan, P Sridhar, KJ Han, S Watanabe 2022 IEEE Spoken Language Technology Workshop (SLT), 84-91, 2023 | 67 | 2023 |
Robust language identification using convolutional neural network features. S Ganapathy, KJ Han, S Thomas, MK Omar, M Van Segbroeck, ... Interspeech, 1846-1850, 2014 | 67 | 2014 |
A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system. KJ Han, SS Narayanan Interspeech, 1853-1856, 2007 | 57 | 2007 |
Slue: New benchmark tasks for spoken language understanding evaluation on natural speech S Shon, A Pasad, F Wu, P Brusco, Y Artzi, K Livescu, KJ Han ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 56 | 2022 |
Combining five acoustic level modeling methods for automatic speaker age and gender recognition. M Li, CS Jung, KJ Han INTERSPEECH, 2826-2829, 2010 | 46 | 2010 |
Multistream CNN for robust acoustic modeling KJ Han, J Pan, VKN Tadala, T Ma, D Povey ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 43 | 2021 |
Deep Learning-Based Telephony Speech Recognition in the Wild KJ Han, S Hahm, BH Kim, J Kim, IR Lane INTERSPEECH, 1323-1327, 2017 | 38 | 2017 |
Performance-efficiency trade-offs in unsupervised pre-training for speech recognition F Wu, K Kim, J Pan, KJ Han, KQ Weinberger, Y Artzi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 37 | 2022 |
Speaker diarization with lexical information TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan arXiv preprint arXiv:2004.06756, 2020 | 36 | 2020 |
ASAPP-ASR: Multistream CNN and self-attentive SRU for SOTA speech recognition J Pan, J Shapiro, J Wohlwend, KJ Han, T Lei, T Ma arXiv preprint arXiv:2005.10469, 2020 | 33 | 2020 |
Identifying a driver of a vehicle SV Myers, S Elwart, WJ Talamonti, JT Mullen, ZD Nelson, T Smith, ... US Patent 9,707,911, 2017 | 32 | 2017 |
Wav2seq: Pre-training speech-to-text encoder-decoder models using pseudo languages F Wu, K Kim, S Watanabe, KJ Han, R McDonald, KQ Weinberger, Y Artzi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 28 | 2023 |
Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling. KJ Han, SS Narayanan Interspeech, 20-23, 2008 | 28 | 2008 |
Novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering KJ Han, SS Narayanan 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 22 | 2008 |