Statistical parametric speech synthesis
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …
Open-source high quality speech datasets for Basque, Catalan and Galician
O Kjartansson, A Gutkin, A Butryna… - Proceedings of the …, 2020 - aclanthology.org
This paper introduces new open speech datasets for three of the languages of Spain:
Basque, Catalan and Galician. Catalan is furthermore the official language of the Principality …
Basque, Catalan and Galician. Catalan is furthermore the official language of the Principality …
Language and noise transfer in speech enhancement generative adversarial network
Speech enhancement deep learning systems usually require large amounts of training data
to operate in broad conditions or real applications. This makes the adaptability of those …
to operate in broad conditions or real applications. This makes the adaptability of those …
Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence
Until now, speech synthesis has mainly involved reading-style speech. Today, however, text-
to-speech systems must provide a variety of styles because users expect these interfaces to …
to-speech systems must provide a variety of styles because users expect these interfaces to …
Analysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system
A set of tools to analyze inconsistencies observed in a Cat_ToBI labeling experiment are
presented. We formalize and use the metrics that are commonly used in inconsistency tests …
presented. We formalize and use the metrics that are commonly used in inconsistency tests …
Phonetic inventory for an Arabic speech corpus
N Halabi, M Wald - 2016 - eprints.soton.ac.uk
Corpus design for speech synthesis is a well-researched topic in languages such as English
compared to Modern Standard Arabic, and there is a tendency to focus on methods to …
compared to Modern Standard Arabic, and there is a tendency to focus on methods to …
[PDF][PDF] Personalized synthetic voices for speaking impaired: website and app.
This paper describes the current state of the work that is being carried out in the framework
of the ZureTTS project to give a personalized voice to people who cannot speak in their own …
of the ZureTTS project to give a personalized voice to people who cannot speak in their own …
CATOTRON–a neural text-to-speech system in Catalan
B Külebi, A Öktem, À Peiró Lilja… - … of Interspeech 2020; …, 2020 - repositori.upf.edu
We present Catotron, a neural network-based open-source speech synthesis system in
Catalan. Catotron consists of a sequence-to-sequence model trained with two small …
Catalan. Catotron consists of a sequence-to-sequence model trained with two small …
Arabic speech corpus
N Halabi - Oxford Text Archive Core Collection, 2016 - llds.phon.ox.ac.uk
The resource is a speech corpus, with digital audio files, text transcripts, and files containing
time stamps of the phoneme boundaries. There are 1813. wav files containing spoken …
time stamps of the phoneme boundaries. There are 1813. wav files containing spoken …
[PDF][PDF] ZureTTS: Online platform for obtaining personalized synthetic voices
The primary goal of the ZureTTS project was the design and development of a web interface
that allows nonexpert users to get their own personalized synthetic voice with minimal effort …
that allows nonexpert users to get their own personalized synthetic voice with minimal effort …