Sparse representation for frequency warping based voice conversion

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier

Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

被引用次数：337 相关文章所有 6 个版本

[PDF] isca-archive.org

[PDF][PDF] Average Modeling Approach to Voice Conversion with Non-Parallel Data.

X Tian, J Wang, H Xu, ES Chng, H Li - Odyssey, 2018 - isca-archive.org

Voice conversion techniques typically require source-target parallel speech data for model
training. Such parallel data may not be available always in practice. This paper presents a …

被引用次数：56 相关文章所有 6 个版本

[PDF] ntu.edu.sg

An exemplar-based approach to frequency warping for voice conversion

X Tian, SW Lee, Z Wu, ES Chng… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org

The voice conversion's task is to modify a source speaker's voice to sound like that of a
target speaker. A conversion method is considered successful when the produced speech …

被引用次数：50 相关文章所有 4 个版本

[PDF] arxiv.org

Noise-robust voice conversion with domain adversarial training

H Du, L Xie, H Li - Neural Networks, 2022 - Elsevier

Voice conversion has made great progress in the past few years under the studio-quality test
scenario in terms of speech quality and speaker similarity. However, in real applications, test …

被引用次数：10 相关文章所有 6 个版本

[PDF] arxiv.org

Optimizing voice conversion network with cycle consistency loss of speaker identity

H Du, X Tian, L Xie, H Li - 2021 IEEE Spoken language …, 2021 - ieeexplore.ieee.org

We propose a novel training scheme to optimize voice conversion network with a speaker
identity loss function. The training scheme not only minimizes frame-level spectral loss, but …

被引用次数：19 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data.

X Tian, ES Chng, H Li - Interspeech, 2019 - isca-archive.org

In a typical voice conversion system, vocoder is commonly used for speech-to-features
analysis and features-to-speech synthesis. However, vocoder can be a source of speech …

被引用次数：27 相关文章所有 5 个版本

[PDF] arxiv.org

High quality voice conversion using prosodic and high-resolution spectral features

HQ Nguyen, SW Lee, X Tian, M Dong… - Multimedia Tools and …, 2016 - Springer

Voice conversion methods have advanced rapidly over the last decade. Studies have shown
that speaker characteristics are captured by spectral feature as well as various prosodic …

被引用次数：30 相关文章所有 12 个版本

[PDF] mdpi.com

Noise-robust voice conversion using high-quefrency boosting via sub-band cepstrum conversion and fusion

X Miao, M Sun, X Zhang, Y Wang - Applied Sciences, 2019 - mdpi.com

Featured Application In this paper, we proposed a method of noise-robust voice conversion
using high-quefrency boosting via sub-band cepstrum conversion and fusion. This method …

被引用次数：14 相关文章所有 5 个版本

[PDF] academia.edu

Speaker-independent spectral mapping for speech-to-singing conversion

X Gao, X Tian, RK Das, Y Zhou… - 2019 Asia-Pacific Signal …, 2019 - ieeexplore.ieee.org

Speech-to-Singing (STS) conversion aims at converting one's reading speech into his/her
singing vocal. The prior work was mainly focused on transforming the prosody of speech to …

被引用次数：14 相关文章所有 4 个版本

[PDF] mdpi.com

Any-to-One Non-Parallel Voice Conversion System Using an Autoregressive Conversion Model and LPCNet Vocoder

K Ezzine, J Di Martino, M Frikha - Applied Sciences, 2023 - mdpi.com

We present an any-to-one voice conversion (VC) system, using an autoregressive model
and LPCNet vocoder, aimed at enhancing the converted speech in terms of naturalness …

被引用次数：1 相关文章所有 10 个版本