An overview of voice conversion systems

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

[PDF][PDF] Average Modeling Approach to Voice Conversion with Non-Parallel Data.

X Tian, J Wang, H Xu, ES Chng, H Li - Odyssey, 2018 - isca-archive.org
Voice conversion techniques typically require source-target parallel speech data for model
training. Such parallel data may not be available always in practice. This paper presents a …

An exemplar-based approach to frequency warping for voice conversion

X Tian, SW Lee, Z Wu, ES Chng… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org
The voice conversion's task is to modify a source speaker's voice to sound like that of a
target speaker. A conversion method is considered successful when the produced speech …

Noise-robust voice conversion with domain adversarial training

H Du, L Xie, H Li - Neural Networks, 2022 - Elsevier
Voice conversion has made great progress in the past few years under the studio-quality test
scenario in terms of speech quality and speaker similarity. However, in real applications, test …

Optimizing voice conversion network with cycle consistency loss of speaker identity

H Du, X Tian, L Xie, H Li - 2021 IEEE Spoken language …, 2021 - ieeexplore.ieee.org
We propose a novel training scheme to optimize voice conversion network with a speaker
identity loss function. The training scheme not only minimizes frame-level spectral loss, but …

[PDF][PDF] A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data.

X Tian, ES Chng, H Li - Interspeech, 2019 - isca-archive.org
In a typical voice conversion system, vocoder is commonly used for speech-to-features
analysis and features-to-speech synthesis. However, vocoder can be a source of speech …

High quality voice conversion using prosodic and high-resolution spectral features

HQ Nguyen, SW Lee, X Tian, M Dong… - Multimedia Tools and …, 2016 - Springer
Voice conversion methods have advanced rapidly over the last decade. Studies have shown
that speaker characteristics are captured by spectral feature as well as various prosodic …

Noise-robust voice conversion using high-quefrency boosting via sub-band cepstrum conversion and fusion

X Miao, M Sun, X Zhang, Y Wang - Applied Sciences, 2019 - mdpi.com
Featured Application In this paper, we proposed a method of noise-robust voice conversion
using high-quefrency boosting via sub-band cepstrum conversion and fusion. This method …

Speaker-independent spectral mapping for speech-to-singing conversion

X Gao, X Tian, RK Das, Y Zhou… - 2019 Asia-Pacific Signal …, 2019 - ieeexplore.ieee.org
Speech-to-Singing (STS) conversion aims at converting one's reading speech into his/her
singing vocal. The prior work was mainly focused on transforming the prosody of speech to …

Any-to-One Non-Parallel Voice Conversion System Using an Autoregressive Conversion Model and LPCNet Vocoder

K Ezzine, J Di Martino, M Frikha - Applied Sciences, 2023 - mdpi.com
We present an any-to-one voice conversion (VC) system, using an autoregressive model
and LPCNet vocoder, aimed at enhancing the converted speech in terms of naturalness …