High quality voice conversion using prosodic and high-resolution spectral features

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org

Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while keeping the linguistic …

被引用次数：371 相关文章所有 8 个版本

[PDF] uky.edu

Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition

M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

被引用次数：25 相关文章所有 4 个版本

[PDF] ntu.edu.sg

An exemplar-based approach to frequency warping for voice conversion

X Tian, SW Lee, Z Wu, ES Chng… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org

The voice conversion's task is to modify a source speaker's voice to sound like that of a
target speaker. A conversion method is considered successful when the produced speech …

被引用次数：50 相关文章所有 4 个版本

[PDF] apsipa.org

Transformation of prosody in voice conversion

B Şişman, H Li, KC Tan - 2017 Asia-Pacific Signal and …, 2017 - ieeexplore.ieee.org

Voice Conversion (VC) aims to convert one's voice to sound like that of another. So far, most
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …

被引用次数：38 相关文章所有 5 个版本

[PDF] isca-archive.org

[PDF][PDF] High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder.

K Chen, B Chen, J Lai, K Yu - Interspeech, 2018 - isca-archive.org

Waveform generator is a key component in voice conversion. Recently, WaveNet waveform
generator conditioned on the Mel-cepstrum (Mcep) has shown better quality over standard …

被引用次数：28 相关文章所有 3 个版本

[HTML] mdpi.com

[HTML][HTML] Noise-robust voice conversion using high-quefrency boosting via sub-band cepstrum conversion and fusion

X Miao, M Sun, X Zhang, Y Wang - Applied Sciences, 2019 - mdpi.com

Featured Application In this paper, we proposed a method of noise-robust voice conversion
using high-quefrency boosting via sub-band cepstrum conversion and fusion. This method …

被引用次数：14 相关文章所有 5 个版本

Application of voice recognition interaction and big data internet of things in urban fire fighting

X Sun, K Cai, B Chen, J Zha, G Zhou - Journal of Location Based …, 2024 - Taylor & Francis

With the continuous development of science and technology, especially computer
technology, people need a more convenient and natural way to communicate with the …

被引用次数：6 相关文章

[HTML] mdpi.com

[HTML][HTML] Any-to-One Non-Parallel Voice Conversion System Using an Autoregressive Conversion Model and LPCNet Vocoder

K Ezzine, J Di Martino, M Frikha - Applied Sciences, 2023 - mdpi.com

We present an any-to-one voice conversion (VC) system, using an autoregressive model
and LPCNet vocoder, aimed at enhancing the converted speech in terms of naturalness …

被引用次数：1 相关文章所有 10 个版本

Prosodic transformation in vocal emotion conversion for multi-lingual scenarios: A pilot study

S Vekkot, D Gupta - International Journal of Speech Technology, 2019 - Springer

The primary objective of this work is to compare patterns for vocal expression across distinct
linguistic contexts. Five language (datasets) are taken for experimentation viz. German …

被引用次数：9 相关文章所有 2 个版本

[PDF] scitepress.org

Fast many-to-one voice conversion using autoencoders

Y Sekii, R Orihara, K Kojima, Y Sei… - … Conference on Agents …, 2017 - scitepress.org

Most of voice conversion (VC) methods were dealing with a one-to-one VC issue and there
were few studies that tackled many-to-one/many-to-many cases. It is difficult to prepare the …

被引用次数：9 相关文章所有 5 个版本