The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech S Araki, R Mukai, S Makino, T Nishikawa, H Saruwatari IEEE Transactions on Speech and Audio Processing 11 (2), 109-116, 2003 | 494 | 2003 |
Determined blind source separation unifying independent vector analysis and nonnegative matrix factorization D Kitamura, N Ono, H Sawada, H Kameoka, H Saruwatari IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (9), 1626 …, 2016 | 408 | 2016 |
Blind source separation combining independent component analysis and beamforming H Saruwatari, S Kurita, K Takeda, F Itakura, T Nishikawa, K Shikano EURASIP Journal on Advances in Signal Processing 2003, 1-12, 2003 | 307 | 2003 |
Evaluation of blind signal separation method using directivity pattern under reverberant conditions S Kurita, H Saruwatari, S Kajita, K Takeda, F Itakura 2000 IEEE International Conference on Acoustics, Speech, and Signal …, 2000 | 287 | 2000 |
Statistical parametric speech synthesis incorporating generative adversarial networks Y Saito, S Takamichi, H Saruwatari IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (1), 84-96, 2017 | 264 | 2017 |
Blind source separation based on a fast-convergence algorithm combining ICA and beamforming H Saruwatari, T Kawamura, T Nishikawa, A Lee, K Shikano IEEE Transactions on Audio, speech, and language processing 14 (2), 666-678, 2006 | 262 | 2006 |
Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum T Toda, H Saruwatari, K Shikano 2001 IEEE international conference on acoustics, speech, and signal …, 2001 | 231 | 2001 |
Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech K Nakamura, T Toda, H Saruwatari, K Shikano Speech communication 54 (1), 134-146, 2012 | 219 | 2012 |
Blind spatial subtraction array for speech enhancement in noisy environment Y Takahashi, T Takatani, K Osako, H Saruwatari, K Shikano IEEE transactions on audio, speech, and language processing 17 (4), 650-664, 2009 | 158 | 2009 |
JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis R Sonobe, S Takamichi, H Saruwatari arXiv preprint arXiv:1711.00354, 2017 | 149 | 2017 |
Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation Y Ohtani, T Toda, H Saruwatari, K Shikano | 142 | 2006 |
GMM-based voice conversion applied to emotional speech synthesis H Kawanami, Y Iwami, T Toda, H Saruwatari, K Shikano | 128 | 2003 |
実環境研究プラットホームとしての音声情報案内システムの運用 西村竜一, 西原洋平, 鶴身玲典, 李晃伸, 猿渡洋, 鹿野清宏 電子情報通信学会論文誌 D 87 (3), 789-798, 2004 | 122* | 2004 |
A review of blind source separation methods: two converging routes to ILRMA originating from ICA and NMF H Sawada, N Ono, H Kameoka, D Kitamura, H Saruwatari APSIPA Transactions on Signal and Information Processing 8, e12, 2019 | 119 | 2019 |
Equivalence between frequency-domain blind source separation and frequency-domain adaptive beamforming for convolutive mixtures S Araki, S Makino, Y Hinamoto, R Mukai, T Nishikawa, H Saruwatari EURASIP Journal on Advances in Signal Processing 2003, 1-10, 2003 | 112 | 2003 |
Public speech-oriented guidance system with adult and child discrimination capability R Nisimura, A Lee, H Saruwatari, K Shikano 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 106 | 2004 |
IL-9 receptor signaling in memory B cells regulates humoral recall responses S Takatsuka, H Yamada, K Haniuda, H Saruwatari, M Ichihashi, ... Nature immunology 19 (9), 1025-1034, 2018 | 99 | 2018 |
Noice robust real world spoken dialogue system using GMM based rejection of unintended inputs A Lee, K Nakamura, R Nisimura, H Saruwatari, K Shikano | 96 | 2004 |
Automatic optimization scheme of spectral subtraction based on musical noise assessment via higher-order statistics Y Uemura, Y Takahashi, H Saruwatari, K Shikano, K Kondo | 95 | 2008 |
Musical-noise-free speech enhancement based on optimized iterative spectral subtraction R Miyazaki, H Saruwatari, T Inoue, Y Takahashi, K Shikano, K Kondo IEEE Transactions on Audio, Speech, and Language Processing 20 (7), 2080-2094, 2012 | 89 | 2012 |