Transfer learning for speech and language processing

D Wang, TF Zheng - 2015 Asia-Pacific Signal and Information …, 2015 - ieeexplore.ieee.org
Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …

End-to-end speech recognition: A survey

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

Large-scale multilingual speech recognition with a streaming end-to-end model

A Kannan, A Datta, TN Sainath, E Weinstein… - arXiv preprint arXiv …, 2019 - arxiv.org
Multilingual end-to-end (E2E) models have shown great promise in expansion of automatic
speech recognition (ASR) coverage of the world's languages. They have shown …

Multilingual speech recognition with a single end-to-end model

S Toshniwal, TN Sainath, RJ Weiss, B Li… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
Training a conventional automatic speech recognition (ASR) system to support multiple
languages is challenging because the sub-word unit, lexicon and word inventories are …

Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview

W Du, Y Maimaitiyiming, M Nijat, L Li, A Hamdulla… - Applied Sciences, 2022 - mdpi.com
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …

Very deep multilingual convolutional neural networks for LVCSR

T Sercu, C Puhrsch, B Kingsbury… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) are a standard component of many current state-of-
the-art Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However …

Speech recognition and keyword spotting for low-resource languages: Babel project research at cued

MJF Gales, KM Knill, A Ragni… - … workshop on spoken …, 2014 - eprints.whiterose.ac.uk
Recently there has been increased interest in Automatic Speech Recognition (ASR) and
Key Word Spotting (KWS) systems for low resource languages. One of the driving forces for …

Data augmentation for low resource languages

A Ragni, KM Knill, SP Rath… - … 2014: 15th annual …, 2014 - eprints.whiterose.ac.uk
Recently there has been interest in the approaches for training speech recognition systems
for languages with limited resources. Under the IARPA Babel program such resources have …

A survey of multilingual models for automatic speech recognition

H Yadav, S Sitaram - arXiv preprint arXiv:2202.12576, 2022 - arxiv.org
Although Automatic Speech Recognition (ASR) systems have achieved human-like
performance for a few languages, the majority of the world's languages do not have usable …

Multilingual deep neural network based acoustic modeling for rapid language adaptation

NT Vu, D Imseng, D Povey, P Motlicek… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org
This paper presents a study on multilingual deep neural network (DNN) based acoustic
modeling and its application to new languages. We investigate the effect of phone merging …