An overview of voice conversion and its challenges: From statistical modeling to deep learning
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while keeping the linguistic …
conversion, we change the speaker identity from one to another, while keeping the linguistic …
Spoofing and countermeasures for speaker verification: A survey
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …
the technology can be susceptible to malicious spoofing attacks. The research community …
An overview of voice conversion systems
SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
In this paper, we describe a novel spectral conversion method for voice conversion (VC). A
Gaussian mixture model (GMM) of the joint probability density of source and target features …
Gaussian mixture model (GMM) of the joint probability density of source and target features …
Voice transformation: a survey
Y Stylianou - 2009 IEEE International Conference on Acoustics …, 2009 - ieeexplore.ieee.org
Voice transformation refers to the various modifications one may apply to the sound
produced by a person, speaking or singing. Voice transformation is usually seen as an add …
produced by a person, speaking or singing. Voice transformation is usually seen as an add …
Voice conversion based on weighted frequency warping
Any modification applied to speech signals has an impact on their perceptual quality. In
particular, voice conversion to modify a source voice so that it is perceived as a specific …
particular, voice conversion to modify a source voice so that it is perceived as a specific …
INCA algorithm for training voice conversion systems from nonparallel corpora
Most existing voice conversion systems, particularly those based on Gaussian mixture
models, require a set of paired acoustic vectors from the source and target speakers to learn …
models, require a set of paired acoustic vectors from the source and target speakers to learn …
Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
E Godoy, O Rosec, T Chonavel - IEEE Transactions on Audio …, 2011 - ieeexplore.ieee.org
In Voice Conversion (VC), the speech of a source speaker is modified to resemble that of a
particular target speaker. Currently, standard VC approaches use Gaussian mixture model …
particular target speaker. Currently, standard VC approaches use Gaussian mixture model …
[PDF][PDF] Analysis of the Voice Conversion Challenge 2016 Evaluation Results.
Abstract The Voice Conversion Challenge 2016 is the first Voice Conversion Challenge in
which different voice conversion systems and approaches using the same voice data were …
which different voice conversion systems and approaches using the same voice data were …
Voco: Text-based insertion and replacement in audio narration
Editing audio narration using conventional software typically involves many painstaking low-
level manipulations. Some state of the art systems allow the editor to work in a text transcript …
level manipulations. Some state of the art systems allow the editor to work in a text transcript …