Statistical parametric speech synthesis

H Zen, K Tokuda, AW Black - speech communication, 2009 - Elsevier
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …

Open-source high quality speech datasets for Basque, Catalan and Galician

O Kjartansson, A Gutkin, A Butryna… - Proceedings of the …, 2020 - aclanthology.org
This paper introduces new open speech datasets for three of the languages of Spain:
Basque, Catalan and Galician. Catalan is furthermore the official language of the Principality …

Language and noise transfer in speech enhancement generative adversarial network

S Pascual, M Park, J Serrà… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
Speech enhancement deep learning systems usually require large amounts of training data
to operate in broad conditions or real applications. This makes the adaptability of those …

Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence

J Adell, D Escudero, A Bonafonte - Speech Communication, 2012 - Elsevier
Until now, speech synthesis has mainly involved reading-style speech. Today, however, text-
to-speech systems must provide a variety of styles because users expect these interfaces to …

Analysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system

D Escudero, L Aguilar, M del Mar Vanrell… - Speech …, 2012 - Elsevier
A set of tools to analyze inconsistencies observed in a Cat_ToBI labeling experiment are
presented. We formalize and use the metrics that are commonly used in inconsistency tests …

Phonetic inventory for an Arabic speech corpus

N Halabi, M Wald - 2016 - eprints.soton.ac.uk
Corpus design for speech synthesis is a well-researched topic in languages such as English
compared to Modern Standard Arabic, and there is a tendency to focus on methods to …

[PDF][PDF] Personalized synthetic voices for speaking impaired: website and app.

D Erro, I Hernaez, A Alonso, D García-Lorenzo… - …, 2015 - isca-archive.org
This paper describes the current state of the work that is being carried out in the framework
of the ZureTTS project to give a personalized voice to people who cannot speak in their own …

CATOTRON–a neural text-to-speech system in Catalan

B Külebi, A Öktem, À Peiró Lilja… - … of Interspeech 2020; …, 2020 - repositori.upf.edu
We present Catotron, a neural network-based open-source speech synthesis system in
Catalan. Catotron consists of a sequence-to-sequence model trained with two small …

Arabic speech corpus

N Halabi - Oxford Text Archive Core Collection, 2016 - llds.phon.ox.ac.uk
The resource is a speech corpus, with digital audio files, text transcripts, and files containing
time stamps of the phoneme boundaries. There are 1813. wav files containing spoken …

[PDF][PDF] ZureTTS: Online platform for obtaining personalized synthetic voices

D Erro, I Hernáez, E Navas, A Alonso, H Arzelus… - Proc …, 2014 - aholab.ehu.eus
The primary goal of the ZureTTS project was the design and development of a web interface
that allows nonexpert users to get their own personalized synthetic voice with minimal effort …