An overview of voice conversion systems
SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
Transfer learning from speech synthesis to voice conversion with non-parallel training data
We present a novel voice conversion (VC) framework by learning from a text-to-speech
(TTS) synthesis system, that is called TTS-VC transfer learning or TTL-VC for short. We first …
(TTS) synthesis system, that is called TTS-VC transfer learning or TTL-VC for short. We first …
Cross-lingual voice conversion with bilingual phonetic posteriorgram and average modeling
This paper presents a cross-lingual voice conversion approach using bilingual Phonetic
PosteriorGram (PPG) and average modeling. The proposed approach makes use of …
PosteriorGram (PPG) and average modeling. The proposed approach makes use of …
[PDF][PDF] Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance.
Developing a voice conversion (VC) system for a particular speaker typically requires
considerable data from both the source and target speakers. This paper aims to effectuate …
considerable data from both the source and target speakers. This paper aims to effectuate …
[PDF][PDF] Average Modeling Approach to Voice Conversion with Non-Parallel Data.
Voice conversion techniques typically require source-target parallel speech data for model
training. Such parallel data may not be available always in practice. This paper presents a …
training. Such parallel data may not be available always in practice. This paper presents a …
An exemplar-based approach to frequency warping for voice conversion
The voice conversion's task is to modify a source speaker's voice to sound like that of a
target speaker. A conversion method is considered successful when the produced speech …
target speaker. A conversion method is considered successful when the produced speech …
Transformation of prosody in voice conversion
Voice Conversion (VC) aims to convert one's voice to sound like that of another. So far, most
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …
Optimizing voice conversion network with cycle consistency loss of speaker identity
We propose a novel training scheme to optimize voice conversion network with a speaker
identity loss function. The training scheme not only minimizes frame-level spectral loss, but …
identity loss function. The training scheme not only minimizes frame-level spectral loss, but …
[PDF][PDF] A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data.
In a typical voice conversion system, vocoder is commonly used for speech-to-features
analysis and features-to-speech synthesis. However, vocoder can be a source of speech …
analysis and features-to-speech synthesis. However, vocoder can be a source of speech …
Realistic transformation of facial and vocal smiles in real-time audiovisual streams
Research in affective computing and cognitive science has shown the importance of
emotional facial and vocal expressions during human-computer and human-human …
emotional facial and vocal expressions during human-computer and human-human …