Transfer learning for speech and language processing
Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …
to other settings or tasks. For example in speech recognition, an acoustic model trained for …
End-to-end speech recognition: A survey
In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …
learning has brought considerable reductions in word error rate of more than 50% relative …
Large-scale multilingual speech recognition with a streaming end-to-end model
Multilingual end-to-end (E2E) models have shown great promise in expansion of automatic
speech recognition (ASR) coverage of the world's languages. They have shown …
speech recognition (ASR) coverage of the world's languages. They have shown …
Multilingual speech recognition with a single end-to-end model
Training a conventional automatic speech recognition (ASR) system to support multiple
languages is challenging because the sub-word unit, lexicon and word inventories are …
languages is challenging because the sub-word unit, lexicon and word inventories are …
Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …
Very deep multilingual convolutional neural networks for LVCSR
Convolutional neural networks (CNNs) are a standard component of many current state-of-
the-art Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However …
the-art Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However …
Speech recognition and keyword spotting for low-resource languages: Babel project research at cued
Recently there has been increased interest in Automatic Speech Recognition (ASR) and
Key Word Spotting (KWS) systems for low resource languages. One of the driving forces for …
Key Word Spotting (KWS) systems for low resource languages. One of the driving forces for …
Data augmentation for low resource languages
Recently there has been interest in the approaches for training speech recognition systems
for languages with limited resources. Under the IARPA Babel program such resources have …
for languages with limited resources. Under the IARPA Babel program such resources have …
A survey of multilingual models for automatic speech recognition
Although Automatic Speech Recognition (ASR) systems have achieved human-like
performance for a few languages, the majority of the world's languages do not have usable …
performance for a few languages, the majority of the world's languages do not have usable …
Multilingual deep neural network based acoustic modeling for rapid language adaptation
This paper presents a study on multilingual deep neural network (DNN) based acoustic
modeling and its application to new languages. We investigate the effect of phone merging …
modeling and its application to new languages. We investigate the effect of phone merging …