Investigation on cross-and multilingual MLP features under matched and mismatched acoustical...

D Wang, TF Zheng - 2015 Asia-Pacific Signal and Information …, 2015 - ieeexplore.ieee.org

Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …

被引用次数：263 相关文章所有 12 个版本

[PDF] ieee.org

End-to-end speech recognition: A survey

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

被引用次数：158 相关文章所有 6 个版本

[PDF] arxiv.org

Large-scale multilingual speech recognition with a streaming end-to-end model

A Kannan, A Datta, TN Sainath, E Weinstein… - arXiv preprint arXiv …, 2019 - arxiv.org

Multilingual end-to-end (E2E) models have shown great promise in expansion of automatic
speech recognition (ASR) coverage of the world's languages. They have shown …

被引用次数：197 相关文章所有 9 个版本

[PDF] arxiv.org

Multilingual speech recognition with a single end-to-end model

S Toshniwal, TN Sainath, RJ Weiss, B Li… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Training a conventional automatic speech recognition (ASR) system to support multiple
languages is challenging because the sub-word unit, lexicon and word inventories are …

被引用次数：303 相关文章所有 12 个版本

[PDF] mdpi.com

Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview

W Du, Y Maimaitiyiming, M Nijat, L Li, A Hamdulla… - Applied Sciences, 2022 - mdpi.com

With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …

被引用次数：14 相关文章所有 3 个版本

[PDF] arxiv.org

Very deep multilingual convolutional neural networks for LVCSR

T Sercu, C Puhrsch, B Kingsbury… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org

Convolutional neural networks (CNNs) are a standard component of many current state-of-
the-art Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However …

被引用次数：279 相关文章所有 9 个版本

[PDF] whiterose.ac.uk

Speech recognition and keyword spotting for low-resource languages: Babel project research at cued

MJF Gales, KM Knill, A Ragni… - … workshop on spoken …, 2014 - eprints.whiterose.ac.uk

Recently there has been increased interest in Automatic Speech Recognition (ASR) and
Key Word Spotting (KWS) systems for low resource languages. One of the driving forces for …

被引用次数：224 相关文章所有 10 个版本

[PDF] whiterose.ac.uk

Data augmentation for low resource languages

A Ragni, KM Knill, SP Rath… - … 2014: 15th annual …, 2014 - eprints.whiterose.ac.uk

Recently there has been interest in the approaches for training speech recognition systems
for languages with limited resources. Under the IARPA Babel program such resources have …

被引用次数：179 相关文章所有 12 个版本

[PDF] arxiv.org

A survey of multilingual models for automatic speech recognition

H Yadav, S Sitaram - arXiv preprint arXiv:2202.12576, 2022 - arxiv.org

Although Automatic Speech Recognition (ASR) systems have achieved human-like
performance for a few languages, the majority of the world's languages do not have usable …

被引用次数：46 相关文章所有 4 个版本

[PDF] epfl.ch

Multilingual deep neural network based acoustic modeling for rapid language adaptation

NT Vu, D Imseng, D Povey, P Motlicek… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org

This paper presents a study on multilingual deep neural network (DNN) based acoustic
modeling and its application to new languages. We investigate the effect of phone merging …

被引用次数：148 相关文章所有 11 个版本