Modeling unvoiced sounds in statistical parametric speech synthesis with a continuous vocoder

In this paper we present our initial results in articulatory-toacoustic conversion based on
tongue movement recordings using Deep Neural Networks (DNNs). Despite the fact that …

被引用次数：72 相关文章所有 12 个版本

[PDF] arxiv.org

Ultrasound-based articulatory-to-acoustic mapping with WaveGlow speech synthesis

TG Csapó, C Zainkó, L Tóth, G Gosztolya… - arXiv preprint arXiv …, 2020 - arxiv.org

For articulatory-to-acoustic mapping using deep neural networks, typically spectral and
excitation parameters of vocoders have been used as the training targets. However …

被引用次数：27 相关文章所有 12 个版本

[PDF] springer.com

Investigations on speaker adaptation using a continuous vocoder within recurrent neural network based text-to-speech synthesis

AR Mandeel, MS Al-Radhi, TG Csapó - Multimedia Tools and Applications, 2023 - Springer

This paper presents an investigation of speaker adaptation using a continuous vocoder for
parametric text-to-speech (TTS) synthesis. In purposes that demand low computational …

被引用次数：8 相关文章所有 5 个版本

[PDF] arxiv.org

Ultrasound-based silent speech interface built on a continuous vocoder

TG Csapó, MS Al-Radhi, G Németh… - arXiv preprint arXiv …, 2019 - arxiv.org

Recently it was shown that within the Silent Speech Interface (SSI) field, the prediction of F0
is possible from Ultrasound Tongue Images (UTI) as the articulatory input, using Deep …

被引用次数：21 相关文章所有 11 个版本

[PDF] isca-archive.org

[PDF][PDF] Time-Domain Envelope Modulating the Noise Component of Excitation in a Continuous Residual-Based Vocoder for Statistical Parametric Speech Synthesis.

MS Al-Radhi, TG Csapó, G Németh - Interspeech, 2017 - isca-archive.org

In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis. Previous work has shown the advantages of adding …

被引用次数：26 相关文章所有 9 个版本

[PDF] mtak.hu

Towards implementing a software tester for benchmarking MAP-T devices

A Al-hamadani, G Lencse - Infocommunications Journal, 2022 - real.mtak.hu

Several IPv6 transition technologies have been designed and developed over the past few
years to accelerate the full adoption of the IPv6 address pool. To make things more …

被引用次数：5 相关文章所有 7 个版本

[PDF] mtak.hu

Speaker adaptation experiments with limited data for end-to-end text-to-speech synthesis using tacotron2

AR Mandeel, MS Al-Radhi, TG Csapó - Infocommunications journal, 2022 - real.mtak.hu

Speech synthesis has the aim of generating humanlike speech from text. Nowadays, with
end-to-end systems, highly natural synthesized speech can be achieved if a large enough …

被引用次数：5 相关文章所有 6 个版本

[PDF] arxiv.org

Speaker Adaptation with Continuous Vocoder-Based DNN-TTS

AR Mandeel, MS Al-Radhi, TG Csapó - Speech and Computer: 23rd …, 2021 - Springer

Traditional vocoder-based statistical parametric speech synthesis can be advantageous in
applications that require low computational complexity. Recent neural vocoders, which can …

被引用次数：7 相关文章所有 4 个版本

A continuous vocoder for statistical parametric speech synthesis and its evaluation using an audio-visual phonetically annotated Arabic corpus

MS Al-Radhi, O Abdo, TG Csapó, S Abdou… - Computer Speech & …, 2020 - Elsevier

In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis by addressing two objectives. First, because the …

被引用次数：13 相关文章所有 2 个版本

[PDF] isca-archive.org

[PDF][PDF] Independent Modelling of High and Low Energy Speech Frames for Spoofing Detection.

G Suthokumar, K Sriskandaraja, V Sethu… - Interspeech, 2017 - isca-archive.org

Spoofing detection systems for automatic speaker verification have moved from only
modelling voiced frames to modelling all speech frames. Unvoiced speech has been shown …

被引用次数：17 相关文章所有 6 个版本