An overview of voice conversion and its challenges: From statistical modeling to deep learning
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while keeping the linguistic …
conversion, we change the speaker identity from one to another, while keeping the linguistic …
Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition
M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …
through slow, uncoordinated control of speech production muscles. Automatic Speech …
An exemplar-based approach to frequency warping for voice conversion
The voice conversion's task is to modify a source speaker's voice to sound like that of a
target speaker. A conversion method is considered successful when the produced speech …
target speaker. A conversion method is considered successful when the produced speech …
Transformation of prosody in voice conversion
Voice Conversion (VC) aims to convert one's voice to sound like that of another. So far, most
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …
[PDF][PDF] High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder.
Waveform generator is a key component in voice conversion. Recently, WaveNet waveform
generator conditioned on the Mel-cepstrum (Mcep) has shown better quality over standard …
generator conditioned on the Mel-cepstrum (Mcep) has shown better quality over standard …
[HTML][HTML] Noise-robust voice conversion using high-quefrency boosting via sub-band cepstrum conversion and fusion
X Miao, M Sun, X Zhang, Y Wang - Applied Sciences, 2019 - mdpi.com
Featured Application In this paper, we proposed a method of noise-robust voice conversion
using high-quefrency boosting via sub-band cepstrum conversion and fusion. This method …
using high-quefrency boosting via sub-band cepstrum conversion and fusion. This method …
Application of voice recognition interaction and big data internet of things in urban fire fighting
X Sun, K Cai, B Chen, J Zha, G Zhou - Journal of Location Based …, 2024 - Taylor & Francis
With the continuous development of science and technology, especially computer
technology, people need a more convenient and natural way to communicate with the …
technology, people need a more convenient and natural way to communicate with the …
[HTML][HTML] Any-to-One Non-Parallel Voice Conversion System Using an Autoregressive Conversion Model and LPCNet Vocoder
We present an any-to-one voice conversion (VC) system, using an autoregressive model
and LPCNet vocoder, aimed at enhancing the converted speech in terms of naturalness …
and LPCNet vocoder, aimed at enhancing the converted speech in terms of naturalness …
Prosodic transformation in vocal emotion conversion for multi-lingual scenarios: A pilot study
The primary objective of this work is to compare patterns for vocal expression across distinct
linguistic contexts. Five language (datasets) are taken for experimentation viz. German …
linguistic contexts. Five language (datasets) are taken for experimentation viz. German …
Fast many-to-one voice conversion using autoencoders
Most of voice conversion (VC) methods were dealing with a one-to-one VC issue and there
were few studies that tackled many-to-one/many-to-many cases. It is difficult to prepare the …
were few studies that tackled many-to-one/many-to-many cases. It is difficult to prepare the …