An overview of voice conversion and its challenges: From statistical modeling to deep learning

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while keeping the linguistic …

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

An overview of voice conversion systems

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory

T Toda, AW Black, K Tokuda - IEEE Transactions on Audio …, 2007 - ieeexplore.ieee.org
In this paper, we describe a novel spectral conversion method for voice conversion (VC). A
Gaussian mixture model (GMM) of the joint probability density of source and target features …

Voice transformation: a survey

Y Stylianou - 2009 IEEE International Conference on Acoustics …, 2009 - ieeexplore.ieee.org
Voice transformation refers to the various modifications one may apply to the sound
produced by a person, speaking or singing. Voice transformation is usually seen as an add …

Voice conversion based on weighted frequency warping

D Erro, A Moreno, A Bonafonte - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org
Any modification applied to speech signals has an impact on their perceptual quality. In
particular, voice conversion to modify a source voice so that it is perceived as a specific …

INCA algorithm for training voice conversion systems from nonparallel corpora

D Erro, A Moreno, A Bonafonte - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org
Most existing voice conversion systems, particularly those based on Gaussian mixture
models, require a set of paired acoustic vectors from the source and target speakers to learn …

Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora

E Godoy, O Rosec, T Chonavel - IEEE Transactions on Audio …, 2011 - ieeexplore.ieee.org
In Voice Conversion (VC), the speech of a source speaker is modified to resemble that of a
particular target speaker. Currently, standard VC approaches use Gaussian mixture model …

[PDF][PDF] Analysis of the Voice Conversion Challenge 2016 Evaluation Results.

M Wester, Z Wu, J Yamagishi - Interspeech, 2016 - isca-archive.org
Abstract The Voice Conversion Challenge 2016 is the first Voice Conversion Challenge in
which different voice conversion systems and approaches using the same voice data were …

Voco: Text-based insertion and replacement in audio narration

Z Jin, GJ Mysore, S Diverdi, J Lu… - ACM Transactions on …, 2017 - dl.acm.org
Editing audio narration using conventional software typically involves many painstaking low-
level manipulations. Some state of the art systems allow the editor to work in a text transcript …