An overview of voice conversion systems

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

Transfer learning from speech synthesis to voice conversion with non-parallel training data

M Zhang, Y Zhou, L Zhao, H Li - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
We present a novel voice conversion (VC) framework by learning from a text-to-speech
(TTS) synthesis system, that is called TTS-VC transfer learning or TTL-VC for short. We first …

Cross-lingual voice conversion with bilingual phonetic posteriorgram and average modeling

Y Zhou, X Tian, H Xu, RK Das… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
This paper presents a cross-lingual voice conversion approach using bilingual Phonetic
PosteriorGram (PPG) and average modeling. The proposed approach makes use of …

[PDF][PDF] Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance.

S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng - Interspeech, 2018 - se.cuhk.edu.hk
Developing a voice conversion (VC) system for a particular speaker typically requires
considerable data from both the source and target speakers. This paper aims to effectuate …

[PDF][PDF] Average Modeling Approach to Voice Conversion with Non-Parallel Data.

X Tian, J Wang, H Xu, ES Chng, H Li - Odyssey, 2018 - isca-archive.org
Voice conversion techniques typically require source-target parallel speech data for model
training. Such parallel data may not be available always in practice. This paper presents a …

An exemplar-based approach to frequency warping for voice conversion

X Tian, SW Lee, Z Wu, ES Chng… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org
The voice conversion's task is to modify a source speaker's voice to sound like that of a
target speaker. A conversion method is considered successful when the produced speech …

Transformation of prosody in voice conversion

B Şişman, H Li, KC Tan - 2017 Asia-Pacific Signal and …, 2017 - ieeexplore.ieee.org
Voice Conversion (VC) aims to convert one's voice to sound like that of another. So far, most
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …

Optimizing voice conversion network with cycle consistency loss of speaker identity

H Du, X Tian, L Xie, H Li - 2021 IEEE Spoken language …, 2021 - ieeexplore.ieee.org
We propose a novel training scheme to optimize voice conversion network with a speaker
identity loss function. The training scheme not only minimizes frame-level spectral loss, but …

[PDF][PDF] A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data.

X Tian, ES Chng, H Li - Interspeech, 2019 - isca-archive.org
In a typical voice conversion system, vocoder is commonly used for speech-to-features
analysis and features-to-speech synthesis. However, vocoder can be a source of speech …

Realistic transformation of facial and vocal smiles in real-time audiovisual streams

P Arias, C Soladie, O Bouafif, A Roebel… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
Research in affective computing and cognitive science has shown the importance of
emotional facial and vocal expressions during human-computer and human-human …