Augmented datasheets for speech datasets and ethical decision-making

O Papakyriakopoulos, ASG Choi, W Thong… - Proceedings of the …, 2023 - dl.acm.org
Speech datasets are crucial for training Speech Language Technologies (SLT); however,
the lack of diversity of the underlying training data can lead to serious limitations in building …

[PDF][PDF] Siri on-device deep learning-guided unit selection text-to-speech system.

T Capes, P Coles, A Conkie, L Golipour… - Interspeech, 2017 - academia.edu
This paper describes Apple's hybrid unit selection speech synthesis system, which provides
the voices for Siri with the requirement of naturalness, personality and expressivity. It has …

[PDF][PDF] The reference corpus of the contemporary Romanian language (CoRoLa)

VB Mititelu, D Tufiş, E Irimia - Proceedings of the Eleventh …, 2018 - aclanthology.org
We present here the largest publicly available corpus of Romanian. Its written component
contains 1,257,752,812 tokens, distributed, in an unbalanced way, in several language …

RSC: A Romanian read speech corpus for automatic speech recognition

AL Georgescu, H Cucu, A Buzo… - Proceedings of the …, 2020 - aclanthology.org
Although many efforts have been made in the last decade to enhance the speech and
language resources for Romanian, this language is still considered under-resourced. While …

A processing platform relating data and tools for Romanian language

V Păiș, R Ion, D Tufiş - … of the 1st International Workshop on …, 2020 - aclanthology.org
This paper presents RELATE (http://relate. racai. ro), a high-performance natural language
platform designed for Romanian language. It is meant both for demonstration of available …

TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

A Stan, O Watts, Y Mamiya, M Giurgiu… - … 2013, 14th Annual …, 2013 - research.ed.ac.uk
Abstract Simple4All Tundra (version 1.0) is the first release of a standardised multilingual
corpus designed for text-to-speech research with imperfect or found data. The corpus …

Unsupervised learning for text-to-speech synthesis

OS Watts - 2013 - era.ed.ac.uk
This thesis introduces a general method for incorporating the distributional analysis of
textual and linguistic objects into text-to-speech (TTS) conversion systems. Conventional …

[PDF][PDF] Towards a romanian end-to-end automatic speech recognition based on deepspeech2

AM Avram, P Vasile, D Tufis - Proc. Rom. Acad. Ser. A, 2020 - academia.edu
This paper presents an implementation of an ASR system for the Romanian language that
uses a multi-layer neural network architecture to transcribe the input speech, augmented …

The SWARA speech corpus: A large parallel Romanian read speech dataset

A Stan, F Dinescu, C Ţiple, Ş Meza… - 2017 International …, 2017 - ieeexplore.ieee.org
This paper introduces one of the largest Romanian speech datasets freely available for both
academic and commercial use. The dataset comprises speech data recorded over the last …

[HTML][HTML] Improving post-filtering of artificial speech using pre-trained LSTM neural networks

M Coto-Jiménez - Biomimetics, 2019 - mdpi.com
Several researchers have contemplated deep learning-based post-filters to increase the
quality of statistical parametric speech synthesis, which perform a mapping of the synthetic …