关注
Kenichi Kumatani
Kenichi Kumatani
Amazon
在 ieee.org 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors
K Kumatani, J McDonough, B Raj
IEEE Signal Processing Magazine 29 (6), 127-140, 2012
1622012
Generation of wake-up words
OA Bapat, K Kumatani
US Patent 9,373,321, 2016
1242016
Unispeech: Unified speech representation learning with labeled and unlabeled data
C Wang, Y Wu, Y Qian, K Kumatani, S Liu, F Wei, M Zeng, X Huang
International Conference on Machine Learning, 10937-10947, 2021
1162021
Adaptive beamforming with a maximum negentropy criterion
K Kumatani, J McDonough, D Klakow, PN Garner, W Li
Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008, 180-183, 2008
91*2008
Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
L Mošner, M Wu, A Raju, SHK Parthasarathi, K Kumatani, S Sundaram, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
712019
Microphone array processing for distant speech recognition: Towards real-world deployment
K Kumatani, T Arakawa, K Yamamoto, J McDonough, B Raj, R Singh, ...
Proceedings of The 2012 Asia Pacific Signal and Information Processing …, 2012
642012
Channel selection based on multichannel cross-correlation coefficients for distant speech recognition
K Kumatani, J McDonough, JF Lehman, B Raj
2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays …, 2011
632011
Direct modeling of raw audio with dnns for wake word detection
K Kumatani, S Panchapagesan, M Wu, M Kim, N Strom, G Tiwari, ...
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
582017
Adaptive beamforming with a minimum mutual information criterion
K Kumatani, T Gehrig, U Mayer, E Stoimenov, J McDonough, M Wolfel
Audio, Speech, and Language Processing, IEEE Transactions on 15 (8), 2527-2541, 2007
52*2007
Frequency domain multi-channel acoustic modeling for distant speech recognition
W Minhua, K Kumatani, S Sundaram, N Ström, B Hoffmeister
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
512019
Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming
K Kumatani, J McDonough, S Schacht, D Klakow, PN Garner, W Li
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
482008
Time-delayed bottleneck highway networks using a DFT feature for keyword spotting
J Guo, K Kumatani, M Sun, M Wu, A Raju, N Ström, A Mandal
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
452018
A Federated Approach in Training Acoustic Models.
D Dimitriadis, RG Ken'ichi Kumatani, R Gmyr, Y Gaur, SE Eskimez
Interspeech, 981-985, 2020
432020
To separate speech: A system for recognizing simultaneous speech
J McDonough, K Kumatani, T Gehrig, E Stoimenov, U Mayer, S Schacht, ...
Proceedings of the 4th international conference on Machine learning for …, 2007
332007
Advances in lecture recognition: The isl rt-06s evaluation system
C Fügen, M Wölfel, JW McDonough, S Ikbal, F Kraft, K Laskowski, ...
Ninth International Conference on Spoken Language Processing, 2006
302006
Trigger word detection using neural network waveform processing
A Mandal, N Strom, K Kumatani, S Panchapagesan
US Patent 10,847,137, 2020
292020
Improving hands-free speech recognition in a car through audio-visual voice activity detection
F Faubel, M Georges, K Kumatani, A Bruhn, D Klakow
2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays …, 2011
252011
Multi-geometry spatial acoustic modeling for distant speech recognition
K Kumatani, W Minhua, S Sundaram, N Ström, B Hoffmeister
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
222019
Maximum kurtosis beamforming with the generalized sidelobe canceller
K Kumatani, J McDonough, B Rauch, PN Garner, W Li, J Dines
Ninth Annual Conference of the International Speech Communication Association, 2008
212008
Multi-modal temporal asynchronicity modeling by product HMMs for robust audio-visual speech recognition
S Nakamura, K Kumatani, S Tamura
Proceedings. Fourth IEEE International Conference on Multimodal Interfaces …, 2002
202002
系统目前无法执行此操作,请稍后再试。
文章 1–20