Eigenvoice conversion based on Gaussian mixture model- 学术资源搜索

Eigenvoice conversion based on Gaussian mixture model

T Toda, Y Ohtani, K Shikano - 2006 - naist.repo.nii.ac.jp

2006•naist.repo.nii.ac.jp

This paper describes a novel framework of voice conversion (VC). We call it eigenvoice conversion (EVC). We apply EVC to the conversion from a source speaker's voice to arbitrary target speakers' voices. Using multiple parallel data sets consisting of utterance-pairs of the source and multiple pre-stored target speakers, a canonical eigenvoice GMM (EV-GMM) is trained in advance. That conversion model enables us to flexibly control the speaker individuality of the converted speech by manually setting weight parameters. In addition, the optimum weight set for a specific target speaker is estimated using only speech data of the target speaker without any linguistic restrictions. We evaluate the performance of EVC by a spectral distortion measure. Experimental results demonstrate that EVC works very well even if we use only a few utterances of the target speaker for the weight estimation.

naist.repo.nii.ac.jp

展开收起

被引用次数：152 相关文章所有 12 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果