[PDF][PDF] DNN-based ultrasound-to-speech conversion for a silent speech interface
In this paper we present our initial results in articulatory-toacoustic conversion based on
tongue movement recordings using Deep Neural Networks (DNNs). Despite the fact that …
tongue movement recordings using Deep Neural Networks (DNNs). Despite the fact that …
Ultrasound-based articulatory-to-acoustic mapping with WaveGlow speech synthesis
For articulatory-to-acoustic mapping using deep neural networks, typically spectral and
excitation parameters of vocoders have been used as the training targets. However …
excitation parameters of vocoders have been used as the training targets. However …
Investigations on speaker adaptation using a continuous vocoder within recurrent neural network based text-to-speech synthesis
This paper presents an investigation of speaker adaptation using a continuous vocoder for
parametric text-to-speech (TTS) synthesis. In purposes that demand low computational …
parametric text-to-speech (TTS) synthesis. In purposes that demand low computational …
Ultrasound-based silent speech interface built on a continuous vocoder
Recently it was shown that within the Silent Speech Interface (SSI) field, the prediction of F0
is possible from Ultrasound Tongue Images (UTI) as the articulatory input, using Deep …
is possible from Ultrasound Tongue Images (UTI) as the articulatory input, using Deep …
[PDF][PDF] Time-Domain Envelope Modulating the Noise Component of Excitation in a Continuous Residual-Based Vocoder for Statistical Parametric Speech Synthesis.
In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis. Previous work has shown the advantages of adding …
statistical parametric speech synthesis. Previous work has shown the advantages of adding …
Towards implementing a software tester for benchmarking MAP-T devices
A Al-hamadani, G Lencse - Infocommunications Journal, 2022 - real.mtak.hu
Several IPv6 transition technologies have been designed and developed over the past few
years to accelerate the full adoption of the IPv6 address pool. To make things more …
years to accelerate the full adoption of the IPv6 address pool. To make things more …
Speaker adaptation experiments with limited data for end-to-end text-to-speech synthesis using tacotron2
Speech synthesis has the aim of generating humanlike speech from text. Nowadays, with
end-to-end systems, highly natural synthesized speech can be achieved if a large enough …
end-to-end systems, highly natural synthesized speech can be achieved if a large enough …
Speaker Adaptation with Continuous Vocoder-Based DNN-TTS
Traditional vocoder-based statistical parametric speech synthesis can be advantageous in
applications that require low computational complexity. Recent neural vocoders, which can …
applications that require low computational complexity. Recent neural vocoders, which can …
A continuous vocoder for statistical parametric speech synthesis and its evaluation using an audio-visual phonetically annotated Arabic corpus
In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis by addressing two objectives. First, because the …
statistical parametric speech synthesis by addressing two objectives. First, because the …
[PDF][PDF] Independent Modelling of High and Low Energy Speech Frames for Spoofing Detection.
Spoofing detection systems for automatic speaker verification have moved from only
modelling voiced frames to modelling all speech frames. Unvoiced speech has been shown …
modelling voiced frames to modelling all speech frames. Unvoiced speech has been shown …