The REVERB challenge: A common evaluation framework for dereverberation and recognition of reverberant speech K Kinoshita, M Delcroix, T Yoshioka, T Nakatani, E Habets, ... 2013 IEEE Workshop on Applications of Signal Processing to Audio and …, 2013 | 457 | 2013 |
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research K Kinoshita, M Delcroix, S Gannot, EA P. Habets, R Haeb-Umbach, ... EURASIP Journal on Advances in Signal Processing 2016, 1-19, 2016 | 405 | 2016 |
Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition T Yoshioka, A Sehr, M Delcroix, K Kinoshita, R Maas, T Nakatani, ... IEEE Signal Processing Magazine 29 (6), 114-126, 2012 | 330 | 2012 |
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ... 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 261 | 2015 |
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration T Nakatani proc. INTERSPEECH 2019, 1408-1412, 2019 | 252 | 2019 |
Suppression of late reverberation effect on speech signal using long-term multiple-step linear prediction K Kinoshita, M Delcroix, T Nakatani, M Miyoshi IEEE transactions on audio, speech, and language processing 17 (4), 534-545, 2009 | 252 | 2009 |
Speakerbeam: Speaker aware neural network for target speaker extraction in speech mixtures K Žmolíková, M Delcroix, K Kinoshita, T Ochiai, T Nakatani, L Burget, ... IEEE Journal of Selected Topics in Signal Processing 13 (4), 800-814, 2019 | 200 | 2019 |
Single channel target speaker extraction and recognition with speaker beam M Delcroix, K Zmolikova, K Kinoshita, A Ogawa, T Nakatani 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 196 | 2018 |
Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 188 | 2008 |
Speech enhancement using self-adaptation and multi-head self-attention Y Koizumi, K Yatabe, M Delcroix, Y Masuyama, D Takeuchi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 139 | 2020 |
Exploring multi-channel features for denoising-autoencoder-based speech enhancement S Araki, T Hayashi, M Delcroix, M Fujimoto, K Takeda, T Nakatani 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 130 | 2015 |
Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB challenge M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ... Proceedings of REVERB Workshop 2014, 2014 | 126 | 2014 |
Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR T Higuchi, N Ito, S Araki, T Yoshioka, M Delcroix, T Nakatani IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (4), 780-793, 2017 | 124 | 2017 |
Speaker-aware neural network based beamformer for speaker extraction in speech mixtures K Žmolíková, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani Interspeech 2017, 2017 | 119 | 2017 |
Precise dereverberation using multichannel linear prediction M Delcroix, T Hikichi, M Miyoshi IEEE Transactions on Audio, Speech, and Language Processing 15 (2), 430-440, 2007 | 114 | 2007 |
Neural Network-Based Spectrum Estimation for Online WPE Dereverberation. K Kinoshita, M Delcroix, H Kwon, T Mori, T Nakatani Interspeech, 384-388, 2017 | 111 | 2017 |
Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations T Hikichi, M Delcroix, M Miyoshi EURASIP Journal on Advances in Signal Processing 2007, 1-12, 2007 | 107 | 2007 |
Improving speaker discrimination of target speech extraction with time-domain speakerbeam M Delcroix, T Ochiai, K Zmolikova, K Kinoshita, N Tawara, T Nakatani, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 106 | 2020 |
All-neural online source separation, counting, and diarization for meeting analysis T Von Neumann, K Kinoshita, M Delcroix, S Araki, T Nakatani, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 104 | 2019 |
Far-field automatic speech recognition R Haeb-Umbach, J Heymann, L Drude, S Watanabe, M Delcroix, ... Proceedings of the IEEE 109 (2), 124-148, 2020 | 101 | 2020 |