CNN architectures for large-scale audio classification S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ... 2017 ieee international conference on acoustics, speech and signal …, 2017 | 2884 | 2017 |
Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation A Ephrat, I Mosseri, O Lang, T Dekel, K Wilson, A Hassidim, WT Freeman, ... arXiv preprint arXiv:1804.03619, 2018 | 857 | 2018 |
Learning the speech front-end with raw waveform CLDNNs. TN Sainath, RJ Weiss, AW Senior, KW Wilson, O Vinyals Interspeech, 1-5, 2015 | 614 | 2015 |
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018 | 400 | 2018 |
Speech denoising using nonnegative matrix factorization with priors KW Wilson, B Raj, P Smaragdis, A Divakaran 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 367 | 2008 |
Speech acoustic modeling from raw multichannel waveforms Y Hoshen, RJ Weiss, KW Wilson 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 280 | 2015 |
Multichannel signal processing with deep neural networks for automatic speech recognition TN Sainath, RJ Weiss, KW Wilson, B Li, A Narayanan, E Variani, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (5), 965-979, 2017 | 265 | 2017 |
Processing multi-channel audio waveforms TN Sainath, RJ Weiss, KW Wilson, AW Senior, A Narayanan, Y Hoshen, ... US Patent 9,697,826, 2017 | 242 | 2017 |
Universal sound separation I Kavalerov, S Wisdom, H Erdogan, B Patton, K Wilson, J Le Roux, ... 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019 | 206 | 2019 |
Acoustic Modeling for Google Home. B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ... Interspeech, 399-403, 2017 | 202 | 2017 |
Unsupervised sound separation using mixture invariant training S Wisdom, E Tzinis, H Erdogan, R Weiss, K Wilson, J Hershey Advances in neural information processing systems 33, 3846-3857, 2020 | 179 | 2020 |
Neural network adaptive beamforming for robust multichannel speech recognition. B Li, TN Sainath, RJ Weiss, KW Wilson, M Bacchiani Interspeech, 1976-1980, 2016 | 145 | 2016 |
Regularized non-negative matrix factorization with temporal dependencies for speech denoising. KW Wilson, B Raj, P Smaragdis Interspeech, 411-414, 2008 | 130 | 2008 |
Visual speech recognition with loosely synchronized feature streams K Saenko, K Livescu, M Siracusa, K Wilson, J Glass, T Darrell Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 2 …, 2005 | 119 | 2005 |
Differentiable consistency constraints for improved deep speech enhancement S Wisdom, JR Hershey, K Wilson, J Thorpe, M Chinen, B Patton, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 114 | 2019 |
Low latency video storyboard delivery with selectable resolution levels NO Krahnstoever, KW Wilson US Patent App. 13/785,913, 2014 | 110 | 2014 |
Multiple person and speaker activity tracking with a particle filter N Checka, KW Wilson, MR Siracusa, T Darrell 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 99 | 2004 |
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani, A Senior 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 94 | 2015 |
VoiceFilter-Lite: Streaming targeted voice separation for on-device speech recognition Q Wang, IL Moreno, M Saglam, K Wilson, A Chiao, R Liu, Y He, W Li, ... arXiv preprint arXiv:2009.04323, 2020 | 92 | 2020 |
Factored spatial and spectral multichannel raw waveform CLDNNs TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 90 | 2016 |