[PDF][PDF] DNN-based ultrasound-to-speech conversion for a silent speech interface

TG Csapó, T Grósz, G Gosztolya, L Tóth, A Markó - 2017 - real.mtak.hu
In this paper we present our initial results in articulatory-toacoustic conversion based on
tongue movement recordings using Deep Neural Networks (DNNs). Despite the fact that …

Ultrasound-based articulatory-to-acoustic mapping with WaveGlow speech synthesis

TG Csapó, C Zainkó, L Tóth, G Gosztolya… - arXiv preprint arXiv …, 2020 - arxiv.org
For articulatory-to-acoustic mapping using deep neural networks, typically spectral and
excitation parameters of vocoders have been used as the training targets. However …

Investigations on speaker adaptation using a continuous vocoder within recurrent neural network based text-to-speech synthesis

AR Mandeel, MS Al-Radhi, TG Csapó - Multimedia Tools and Applications, 2023 - Springer
This paper presents an investigation of speaker adaptation using a continuous vocoder for
parametric text-to-speech (TTS) synthesis. In purposes that demand low computational …

Ultrasound-based silent speech interface built on a continuous vocoder

TG Csapó, MS Al-Radhi, G Németh… - arXiv preprint arXiv …, 2019 - arxiv.org
Recently it was shown that within the Silent Speech Interface (SSI) field, the prediction of F0
is possible from Ultrasound Tongue Images (UTI) as the articulatory input, using Deep …

[PDF][PDF] Time-Domain Envelope Modulating the Noise Component of Excitation in a Continuous Residual-Based Vocoder for Statistical Parametric Speech Synthesis.

MS Al-Radhi, TG Csapó, G Németh - Interspeech, 2017 - isca-archive.org
In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis. Previous work has shown the advantages of adding …

Towards implementing a software tester for benchmarking MAP-T devices

A Al-hamadani, G Lencse - Infocommunications Journal, 2022 - real.mtak.hu
Several IPv6 transition technologies have been designed and developed over the past few
years to accelerate the full adoption of the IPv6 address pool. To make things more …

Speaker adaptation experiments with limited data for end-to-end text-to-speech synthesis using tacotron2

AR Mandeel, MS Al-Radhi, TG Csapó - Infocommunications journal, 2022 - real.mtak.hu
Speech synthesis has the aim of generating humanlike speech from text. Nowadays, with
end-to-end systems, highly natural synthesized speech can be achieved if a large enough …

Speaker Adaptation with Continuous Vocoder-Based DNN-TTS

AR Mandeel, MS Al-Radhi, TG Csapó - Speech and Computer: 23rd …, 2021 - Springer
Traditional vocoder-based statistical parametric speech synthesis can be advantageous in
applications that require low computational complexity. Recent neural vocoders, which can …

A continuous vocoder for statistical parametric speech synthesis and its evaluation using an audio-visual phonetically annotated Arabic corpus

MS Al-Radhi, O Abdo, TG Csapó, S Abdou… - Computer Speech & …, 2020 - Elsevier
In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis by addressing two objectives. First, because the …

[PDF][PDF] Independent Modelling of High and Low Energy Speech Frames for Spoofing Detection.

G Suthokumar, K Sriskandaraja, V Sethu… - Interspeech, 2017 - isca-archive.org
Spoofing detection systems for automatic speaker verification have moved from only
modelling voiced frames to modelling all speech frames. Unvoiced speech has been shown …