Eigenvoice conversion based on Gaussian mixture model
This paper describes a novel framework of voice conversion (VC). We call it eigenvoice
conversion (EVC). We apply EVC to the conversion from a source speaker's voice to
arbitrary target speakers' voices. Using multiple parallel data sets consisting of utterance-
pairs of the source and multiple pre-stored target speakers, a canonical eigenvoice GMM
(EV-GMM) is trained in advance. That conversion model enables us to flexibly control the
speaker individuality of the converted speech by manually setting weight parameters. In …
conversion (EVC). We apply EVC to the conversion from a source speaker's voice to
arbitrary target speakers' voices. Using multiple parallel data sets consisting of utterance-
pairs of the source and multiple pre-stored target speakers, a canonical eigenvoice GMM
(EV-GMM) is trained in advance. That conversion model enables us to flexibly control the
speaker individuality of the converted speech by manually setting weight parameters. In …
This paper describes a novel framework of voice conversion (VC). We call it eigenvoice conversion (EVC). We apply EVC to the conversion from a source speaker's voice to arbitrary target speakers' voices. Using multiple parallel data sets consisting of utterance-pairs of the source and multiple pre-stored target speakers, a canonical eigenvoice GMM (EV-GMM) is trained in advance. That conversion model enables us to flexibly control the speaker individuality of the converted speech by manually setting weight parameters. In addition, the optimum weight set for a specific target speaker is estimated using only speech data of the target speaker without any linguistic restrictions. We evaluate the performance of EVC by a spectral distortion measure. Experimental results demonstrate that EVC works very well even if we use only a few utterances of the target speaker for the weight estimation.
naist.repo.nii.ac.jp
以上显示的是最相近的搜索结果。 查看全部搜索结果