Correlation-based frequency warping for voice conversion

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier

Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

被引用次数：337 相关文章所有 6 个版本

[PDF] ieee.org

Transfer learning from speech synthesis to voice conversion with non-parallel training data

M Zhang, Y Zhou, L Zhao, H Li - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org

We present a novel voice conversion (VC) framework by learning from a text-to-speech
(TTS) synthesis system, that is called TTS-VC transfer learning or TTL-VC for short. We first …

被引用次数：59 相关文章所有 5 个版本

[PDF] researchgate.net

Cross-lingual voice conversion with bilingual phonetic posteriorgram and average modeling

Y Zhou, X Tian, H Xu, RK Das… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org

This paper presents a cross-lingual voice conversion approach using bilingual Phonetic
PosteriorGram (PPG) and average modeling. The proposed approach makes use of …

被引用次数：91 相关文章所有 5 个版本

[PDF] cuhk.edu.hk

[PDF][PDF] Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance.

S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng - Interspeech, 2018 - se.cuhk.edu.hk

Developing a voice conversion (VC) system for a particular speaker typically requires
considerable data from both the source and target speakers. This paper aims to effectuate …

被引用次数：67 相关文章所有 6 个版本

[PDF] isca-archive.org

[PDF][PDF] Average Modeling Approach to Voice Conversion with Non-Parallel Data.

X Tian, J Wang, H Xu, ES Chng, H Li - Odyssey, 2018 - isca-archive.org

Voice conversion techniques typically require source-target parallel speech data for model
training. Such parallel data may not be available always in practice. This paper presents a …

被引用次数：56 相关文章所有 6 个版本

[PDF] ntu.edu.sg

An exemplar-based approach to frequency warping for voice conversion

X Tian, SW Lee, Z Wu, ES Chng… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org

The voice conversion's task is to modify a source speaker's voice to sound like that of a
target speaker. A conversion method is considered successful when the produced speech …

被引用次数：50 相关文章所有 4 个版本

[PDF] apsipa.org

Transformation of prosody in voice conversion

B Şişman, H Li, KC Tan - 2017 Asia-Pacific Signal and …, 2017 - ieeexplore.ieee.org

Voice Conversion (VC) aims to convert one's voice to sound like that of another. So far, most
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …

被引用次数：38 相关文章所有 5 个版本

[PDF] arxiv.org

Optimizing voice conversion network with cycle consistency loss of speaker identity

H Du, X Tian, L Xie, H Li - 2021 IEEE Spoken language …, 2021 - ieeexplore.ieee.org

We propose a novel training scheme to optimize voice conversion network with a speaker
identity loss function. The training scheme not only minimizes frame-level spectral loss, but …

被引用次数：19 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data.

X Tian, ES Chng, H Li - Interspeech, 2019 - isca-archive.org

In a typical voice conversion system, vocoder is commonly used for speech-to-features
analysis and features-to-speech synthesis. However, vocoder can be a source of speech …

被引用次数：27 相关文章所有 5 个版本

[PDF] ieee.org

Realistic transformation of facial and vocal smiles in real-time audiovisual streams

P Arias, C Soladie, O Bouafif, A Roebel… - IEEE Transactions …, 2018 - ieeexplore.ieee.org

Research in affective computing and cognitive science has shown the importance of
emotional facial and vocal expressions during human-computer and human-human …

被引用次数：34 相关文章所有 6 个版本