Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors K Kumatani, J McDonough, B Raj IEEE Signal Processing Magazine 29 (6), 127-140, 2012 | 162 | 2012 |
Generation of wake-up words OA Bapat, K Kumatani US Patent 9,373,321, 2016 | 124 | 2016 |
Unispeech: Unified speech representation learning with labeled and unlabeled data C Wang, Y Wu, Y Qian, K Kumatani, S Liu, F Wei, M Zeng, X Huang International Conference on Machine Learning, 10937-10947, 2021 | 116 | 2021 |
Adaptive beamforming with a maximum negentropy criterion K Kumatani, J McDonough, D Klakow, PN Garner, W Li Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008, 180-183, 2008 | 91* | 2008 |
Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning L Mošner, M Wu, A Raju, SHK Parthasarathi, K Kumatani, S Sundaram, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 71 | 2019 |
Microphone array processing for distant speech recognition: Towards real-world deployment K Kumatani, T Arakawa, K Yamamoto, J McDonough, B Raj, R Singh, ... Proceedings of The 2012 Asia Pacific Signal and Information Processing …, 2012 | 64 | 2012 |
Channel selection based on multichannel cross-correlation coefficients for distant speech recognition K Kumatani, J McDonough, JF Lehman, B Raj 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays …, 2011 | 63 | 2011 |
Direct modeling of raw audio with dnns for wake word detection K Kumatani, S Panchapagesan, M Wu, M Kim, N Strom, G Tiwari, ... 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 58 | 2017 |
Adaptive beamforming with a minimum mutual information criterion K Kumatani, T Gehrig, U Mayer, E Stoimenov, J McDonough, M Wolfel Audio, Speech, and Language Processing, IEEE Transactions on 15 (8), 2527-2541, 2007 | 52* | 2007 |
Frequency domain multi-channel acoustic modeling for distant speech recognition W Minhua, K Kumatani, S Sundaram, N Ström, B Hoffmeister ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 51 | 2019 |
Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming K Kumatani, J McDonough, S Schacht, D Klakow, PN Garner, W Li 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 48 | 2008 |
Time-delayed bottleneck highway networks using a DFT feature for keyword spotting J Guo, K Kumatani, M Sun, M Wu, A Raju, N Ström, A Mandal 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 45 | 2018 |
A Federated Approach in Training Acoustic Models. D Dimitriadis, RG Ken'ichi Kumatani, R Gmyr, Y Gaur, SE Eskimez Interspeech, 981-985, 2020 | 43 | 2020 |
To separate speech: A system for recognizing simultaneous speech J McDonough, K Kumatani, T Gehrig, E Stoimenov, U Mayer, S Schacht, ... Proceedings of the 4th international conference on Machine learning for …, 2007 | 33 | 2007 |
Advances in lecture recognition: The isl rt-06s evaluation system C Fügen, M Wölfel, JW McDonough, S Ikbal, F Kraft, K Laskowski, ... Ninth International Conference on Spoken Language Processing, 2006 | 30 | 2006 |
Trigger word detection using neural network waveform processing A Mandal, N Strom, K Kumatani, S Panchapagesan US Patent 10,847,137, 2020 | 29 | 2020 |
Improving hands-free speech recognition in a car through audio-visual voice activity detection F Faubel, M Georges, K Kumatani, A Bruhn, D Klakow 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays …, 2011 | 25 | 2011 |
Multi-geometry spatial acoustic modeling for distant speech recognition K Kumatani, W Minhua, S Sundaram, N Ström, B Hoffmeister ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 22 | 2019 |
Maximum kurtosis beamforming with the generalized sidelobe canceller K Kumatani, J McDonough, B Rauch, PN Garner, W Li, J Dines Ninth Annual Conference of the International Speech Communication Association, 2008 | 21 | 2008 |
Multi-modal temporal asynchronicity modeling by product HMMs for robust audio-visual speech recognition S Nakamura, K Kumatani, S Tamura Proceedings. Fourth IEEE International Conference on Multimodal Interfaces …, 2002 | 20 | 2002 |