An overview of voice conversion and its challenges: From statistical modeling to deep learning

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while keeping the linguistic …

Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition

M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

An exemplar-based approach to frequency warping for voice conversion

X Tian, SW Lee, Z Wu, ES Chng… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org
The voice conversion's task is to modify a source speaker's voice to sound like that of a
target speaker. A conversion method is considered successful when the produced speech …

Transformation of prosody in voice conversion

B Şişman, H Li, KC Tan - 2017 Asia-Pacific Signal and …, 2017 - ieeexplore.ieee.org
Voice Conversion (VC) aims to convert one's voice to sound like that of another. So far, most
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …

[PDF][PDF] High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder.

K Chen, B Chen, J Lai, K Yu - Interspeech, 2018 - isca-archive.org
Waveform generator is a key component in voice conversion. Recently, WaveNet waveform
generator conditioned on the Mel-cepstrum (Mcep) has shown better quality over standard …

[HTML][HTML] Noise-robust voice conversion using high-quefrency boosting via sub-band cepstrum conversion and fusion

X Miao, M Sun, X Zhang, Y Wang - Applied Sciences, 2019 - mdpi.com
Featured Application In this paper, we proposed a method of noise-robust voice conversion
using high-quefrency boosting via sub-band cepstrum conversion and fusion. This method …

Application of voice recognition interaction and big data internet of things in urban fire fighting

X Sun, K Cai, B Chen, J Zha, G Zhou - Journal of Location Based …, 2024 - Taylor & Francis
With the continuous development of science and technology, especially computer
technology, people need a more convenient and natural way to communicate with the …

[HTML][HTML] Any-to-One Non-Parallel Voice Conversion System Using an Autoregressive Conversion Model and LPCNet Vocoder

K Ezzine, J Di Martino, M Frikha - Applied Sciences, 2023 - mdpi.com
We present an any-to-one voice conversion (VC) system, using an autoregressive model
and LPCNet vocoder, aimed at enhancing the converted speech in terms of naturalness …

Prosodic transformation in vocal emotion conversion for multi-lingual scenarios: A pilot study

S Vekkot, D Gupta - International Journal of Speech Technology, 2019 - Springer
The primary objective of this work is to compare patterns for vocal expression across distinct
linguistic contexts. Five language (datasets) are taken for experimentation viz. German …

Fast many-to-one voice conversion using autoencoders

Y Sekii, R Orihara, K Kojima, Y Sei… - … Conference on Agents …, 2017 - scitepress.org
Most of voice conversion (VC) methods were dealing with a one-to-one VC issue and there
were few studies that tackled many-to-one/many-to-many cases. It is difficult to prepare the …