Ryuichi Yamamoto 个人学术档案

引用次数

	总计	2019 年至今
引用	2571	2478
h 指数	15	14
i10 指数	20	17

660

330

165

495

20152016201720182019202020212022202320247 16 33 25 63 288 615 605 650 234

开放获取的出版物数量

查看全部

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Eunwoo SongVoice, Naver Cloud在 navercorp.com 的电子邮件经过验证
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya University在 g.sp.m.is.nagoya-u.ac.jp 的电子邮件经过验证
Shinji WatanabeCarnegie Mellon University在 cmu.edu 的电子邮件经过验证
Takenori YoshimuraNagoya Institute of Technology在 nitech.ac.jp 的电子邮件经过验证
Min-Jae HwangMeta AI在 meta.com 的电子邮件经过验证
Tomoki TodaNagoya University在 icts.nagoya-u.ac.jp 的电子邮件经过验证
Shigeki KaritaGoogle在 google.com 的电子邮件经过验证
Hirofumi InagumaFundamental AI Research (FAIR) at Meta在 meta.com 的电子邮件经过验证
Brian McFeeMusic and Performing Arts Professions / Center for Data Science, New York University在 nyu.edu 的电子邮件经过验证
Jiatong Shi (史嘉彤)Carnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Takaaki SaekiGoogle在 google.com 的电子邮件经过验证

关注

Ryuichi Yamamoto

LY Corporation

在 lycorp.co.jp 的电子邮件经过验证 - 首页

Speech Synthesis Voice Conversion Speech Recognition Machine Learning Singing Voice Synthesis


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram R Yamamoto, E Song, JM Kim ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	829	2020
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	785	2019
librosa/librosa: 0.6. 3 B McFee, M McVicar, S Balke, C Thomé, C Raffel, D Lee, O Nieto, ... URL: https://doi. org/10.5281/zenodo 2564164, 2019	347*	2019
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ... ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020	219	2020
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation R Yamamoto, E Song, JM Kim arXiv preprint arXiv:1904.04472, 2019	55	2019
Espnet2-tts: Extending the edge of tts research T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ... arXiv preprint arXiv:2110.07840, 2021	46	2021
TTS-by-TTS: TTS-driven data augmentation for fast and high-quality speech synthesis MJ Hwang, R Yamamoto, E Song, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	37	2021
Improved Parallel WaveGAN vocoder with perceptually weighted spectrogram loss E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim 2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021	21	2021
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators R Yamamoto, E Song, MJ Hwang, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	20	2021
Semi-supervised speaker adaptation for end-to-end speech synthesis with pretrained models K Inoue, S Hara, M Abe, T Hayashi, R Yamamoto, S Watanabe ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	19	2020
Cross-speaker emotion transfer for low-resource text-to-speech using non-parallel voice conversion with pitch-shift data augmentation R Terashima, R Yamamoto, E Song, Y Shirahata, HW Yoon, JM Kim, ... arXiv preprint arXiv:2204.10020, 2022	16	2022
Ryry: A real-time score-following automatic accompaniment playback system capable of real performances with errors, repeats and jumps S Sako, R Yamamoto, T Kitamura Active Media Technology: 10th International Conference, AMT 2014, Warsaw …, 2014	16	2014
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model. MJ Hwang, R Yamamoto, E Song, JM Kim Interspeech, 2227-2231, 2021	15	2021
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis K Futamata, B Park, R Yamamoto, K Tachibana arXiv preprint arXiv:2104.12395, 2021	15	2021
Improving lpcnet-based text-to-speech with linear prediction-structured mixture density network MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	15	2020
Language model-based emotion prediction methods for emotional speech synthesis systems HW Yoon, O Kwon, H Lee, R Yamamoto, E Song, JM Kim, MJ Hwang arXiv preprint arXiv:2206.15067, 2022	14	2022
Score following handling performances with arbitrary repeats and skips and automatic accompaniment E Nakamura, H Takeda, R Yamamoto, Y Saito, S Sako, S Sagayama IPSJ Journal 54 (4), 1338-1349, 2013	14	2013
Neural text-to-speech with a modeling-by-generation excitation vocoder E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim arXiv preprint arXiv:2008.00132, 2020	11	2020
Wavenet vocoder R Yamamoto Wavenet vocoder, 2018	10	2018
Robust on-line algorithm for real-time audio-to-score alignment based on a delayed decision and anticipation framework R Yamamoto, S Sako, T Kitamura 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	10	2013

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用