Statistical parametric speech synthesis incorporating generative adversarial networks Y Saito, S Takamichi, H Saruwatari IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (1), 84-96, 2018 | 264 | 2018 |
JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis R Sonobe, S Takamichi, H Saruwatari arXiv preprint arXiv:1711.00354, 2017 | 146 | 2017 |
Non-parallel voice conversion using variational autoencoders conditioned by phonetic posteriorgrams and d-vectors Y Saito, Y Ijima, K Nishida, S Takamichi 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 142 | 2018 |
Post-Filters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis S Takamichi, T Toda, A Black, S Sakti, G Neubig, S Nakamura IEEE, 2016 | 102 | 2016 |
A postfilter to modify the modulation spectrum in HMM-based speech synthesis S Takamichi, T Toda, G Neubig, S Sakti, S Nakamura 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 87 | 2014 |
Independent deeply learned matrix analysis for determined audio source separation N Makishima, S Mogami, N Takamune, D Kitamura, H Sumino, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (10 …, 2019 | 75 | 2019 |
Utmos: Utokyo-sarulab system for voicemos challenge 2022 T Saeki, D Xin, W Nakata, T Koriyama, S Takamichi, H Saruwatari arXiv preprint arXiv:2204.02152, 2022 | 69 | 2022 |
Voice conversion using sequence-to-sequence learning of context posterior probabilities H Miyoshi, Y Saito, S Takamichi, H Saruwatari arXiv preprint arXiv:1704.02360, 2017 | 67 | 2017 |
JVS corpus: free Japanese multi-speaker voice corpus S Takamichi, K Mitsui, Y Saito, T Koriyama, N Tanji, H Saruwatari arXiv preprint arXiv:1908.06248, 2019 | 65 | 2019 |
The NAIST Text-to-Speech System for the Blizzard Challenge 2015 S Takamichi, K Kobayashi, K Tanaka, T Toda, S Nakamura Blizzard Challenge Workshop, 2015 | 59* | 2015 |
JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research S Takamichi, R Sonobe, K Mitsui, Y Saito, T Koriyama, N Tanji, ... Acoustical Science and Technology 41 (5), 761-768, 2020 | 56 | 2020 |
Phase reconstruction from amplitude spectrograms based on von-Mises-distribution deep neural network S Takamichi, Y Saito, N Takamune, D Kitamura, H Saruwatari 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC …, 2018 | 47 | 2018 |
Espnet2-tts: Extending the edge of tts research T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ... arXiv preprint arXiv:2110.07840, 2021 | 46 | 2021 |
Independent deeply learned matrix analysis for multichannel audio source separation S Mogami, H Sumino, D Kitamura, N Takamune, S Takamichi, ... 2018 26th European Signal Processing Conference (EUSIPCO), 1557-1561, 2018 | 45 | 2018 |
Modulation spectrum-constrained trajectory training algorithm for GMM-based voice conversion S Takamichi, T Toda, AW Black, S Nakamura 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 38 | 2015 |
Training algorithm to deceive anti-spoofing verification for DNN-based speech synthesis Y Saito, S Takamichi, H Saruwatari Proc. ICASSP, 2017 | 37 | 2017 |
The NU-NAIST Voice Conversion System for the Voice Conversion Challenge 2016. K Kobayashi, S Takamichi, S Nakamura, T Toda Interspeech, 1667-1671, 2016 | 35 | 2016 |
Voice conversion using input-to-output highway networks Y Saito, S Takamichi, H Saruwatari IEICE Transactions on Information and Systems 100 (8), 1925-1928, 2017 | 32 | 2017 |
Foley sound synthesis at the dcase 2023 challenge K Choi, J Im, L Heller, B Mcfee, K Imoto, Y Okamoto, M Lagrange, ... arXiv preprint arXiv:2304.12521, 2023 | 30 | 2023 |
Disentangled speaker and language representations using mutual information minimization and domain adaptation for cross-lingual tts D Xin, T Komatsu, S Takamichi, H Saruwatari ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 29 | 2021 |