Transfer learning for speech and language processing

D Wang, TF Zheng - 2015 Asia-Pacific Signal and Information …, 2015 - ieeexplore.ieee.org
Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …

Unsupervised speech representation learning using wavenet autoencoders

J Chorowski, RJ Weiss, S Bengio… - … /ACM transactions on …, 2019 - ieeexplore.ieee.org
We consider the task of unsupervised extraction of meaningful latent representations of
speech by applying autoencoding neural networks to speech waveforms. The goal is to …

Unsupervised pretraining transfers well across languages

M Riviere, A Joulin, PE Mazaré… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Cross-lingual and multi-lingual training of Automatic Speech Recognition (ASR) has been
extensively investigated in the supervised setting. This assumes the existence of a parallel …

A survey on automatic speech recognition systems for Portuguese language and its variations

TA de Lima, M Da Costa-Abreu - Computer Speech & Language, 2020 - Elsevier
Communication has been an essential part of being human and living in society. There are
several different languages and variations of them, so you can speak English in one place …

Automatic speech recognition for under-resourced languages: A survey

L Besacier, E Barnard, A Karpov, T Schultz - Speech communication, 2014 - Elsevier
Speech processing for under-resourced languages is an active field of research, which has
experienced significant progress during the past decade. We propose, in this paper, a …

Libri-light: A benchmark for asr with limited or no supervision

J Kahn, M Riviere, W Zheng… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
We introduce a new collection of spoken English audio suitable for training speech
recognition systems under limited or no supervision. It is derived from open-source audio …

Language independent end-to-end architecture for joint language identification and speech recognition

S Watanabe, T Hori, JR Hershey - 2017 IEEE Automatic Speech …, 2017 - ieeexplore.ieee.org
End-to-end automatic speech recognition (ASR) can significantly reduce the burden of
developing ASR systems for new languages, by eliminating the need for linguistic …

Multilingual end-to-end speech translation

H Inaguma, K Duh, T Kawahara… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org
In this paper, we propose a simple yet effective framework for multilingual end-to-end
speech translation (ST), in which speech utterances in source languages are directly …

Data augmentation for low resource languages

A Ragni, KM Knill, SP Rath… - … 2014: 15th annual …, 2014 - eprints.whiterose.ac.uk
Recently there has been interest in the approaches for training speech recognition systems
for languages with limited resources. Under the IARPA Babel program such resources have …

Sequence-based multi-lingual low resource speech recognition

S Dalmia, R Sanabria, F Metze… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Techniques for multi-lingual and cross-lingual speech recognition can help in low resource
scenarios, to bootstrap systems and enable analysis of new languages and domains. End-to …