Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion
Research on deep learning-powered voice conversion (VC) in speech-to-speech scenarios
is getting increasingly popular. Although many of the works in the field of voice conversion …
is getting increasingly popular. Although many of the works in the field of voice conversion …
Zero-shot voice conditioning for denoising diffusion tts models
We present a novel way of conditioning a pretrained denoising diffusion speech model to
produce speech in the voice of a novel person unseen during training. The method requires …
produce speech in the voice of a novel person unseen during training. The method requires …
Voice conversion can improve asr in very low-resource settings
Voice conversion (VC) could be used to improve speech recognition systems in low-
resource languages by using it to augment limited training data. However, VC has not been …
resource languages by using it to augment limited training data. However, VC has not been …
One-Shot Voice Conversion Based on Style Generative Adversarial Networks with ESR and DSNet
Y Li, L Pan, X Qiu, Z Yang, Z Tan, B Qian - Circuits, Systems, and Signal …, 2024 - Springer
This paper proposes a novel one-shot voice conversion (VC) method called DS-ESR-
StyleGAN-VC, which encompasses several innovations to address the challenges faced by …
StyleGAN-VC, which encompasses several innovations to address the challenges faced by …
CBFMCycleGAN-VC: Using the Improved MaskCycleGAN-VC to Effectively Predict a Person's Voice After Aging
X Zhou, L Yu, F Niu, J Jin - IEEE Access, 2022 - ieeexplore.ieee.org
One task of nonparallel speech conversion is to convert the source speaker's speech
samples to the target speaker's speech samples, keeping the content unchanged. In view of …
samples to the target speaker's speech samples, keeping the content unchanged. In view of …
Disentanglement Learning for Text-Free Voice Conversion
M Chen - 2023 - etheses.whiterose.ac.uk
Voice conversion (VC) aims to change the perceived speaker identity of a speech signal
from one to another, while preserving the linguistic content. Recent state-of-the-art VC …
from one to another, while preserving the linguistic content. Recent state-of-the-art VC …
[PDF][PDF] Comparison and Development of Practical Voice Conversion Models
M Baas - 2020 - rf5.github.io
Voice conversion (VC) is a speech processing task where speech by a source speaker is
transformed into speech that appears to be spoken by a desired target speaker while …
transformed into speech that appears to be spoken by a desired target speaker while …