Learning lexicons from speech using a pronunciation mixture model

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

被引用次数：80 相关文章所有 6 个版本

[PDF] nowpublishers.com

Deep learning: methods and applications

L Deng, D Yu - Foundations and trends® in signal processing, 2014 - nowpublishers.com

This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

被引用次数：5837 相关文章所有 13 个版本

[HTML] mdpi.com

[HTML][HTML] Frontier Research on Low-Resource Speech Recognition Technology

W Slam, Y Li, N Urouvas - Sensors, 2023 - mdpi.com

With the development of continuous speech recognition technology, users have put forward
higher requirements in terms of speech recognition accuracy. Low-resource speech …

被引用次数：2 相关文章所有 8 个版本

[PDF] microsoft.com

Foundations and trends in signal processing: Deep learning–methods and applications

L Deng, D Yu - 2014 - microsoft.com

This monograph provides an overview of general deep learning methodology and its
applications to a variety of signal and information processing tasks. The application areas …

被引用次数：40 相关文章所有 7 个版本

Multi-accent speech recognition with hierarchical grapheme based models

K Rao, H Sak - … conference on acoustics, speech and signal …, 2017 - ieeexplore.ieee.org

We train grapheme-based acoustic models for speech recognition using a hierarchical
recurrent neural network architecture with connectionist temporal classification (CTC) loss …

被引用次数：86 相关文章所有 4 个版本

Computer multimedia assisted language and literature teaching using Heuristic hidden Markov model and statistical language model

J Zhang, C Wang, A Muthu, VM Varatharaju - Computers & Electrical …, 2022 - Elsevier

Computer technology has been used for decades in secondary education and foreign
language preparation. Still, attempts to incorporate technology have presented educators …

被引用次数：20 相关文章所有 2 个版本

[PDF] arxiv.org

No need for a lexicon? evaluating the value of the pronunciation lexica in end-to-end models

TN Sainath, R Prabhavalkar, S Kumar… - … , Speech and Signal …, 2018 - ieeexplore.ieee.org

For decades, context-dependent phonemes have been the dominant sub-word unit for
conventional acoustic modeling systems. This status quo has begun to be challenged …

被引用次数：52 相关文章所有 8 个版本

[PDF] arxiv.org

Diacritic recognition performance in arabic asr

H Aldarmaki, A Ghannam - arXiv preprint arXiv:2302.14022, 2023 - arxiv.org

We present an analysis of diacritic recognition performance in Arabic Automatic Speech
Recognition (ASR) systems. As most existing Arabic speech corpora do not contain all …

被引用次数：7 相关文章所有 7 个版本

Improved Arabic speech recognition system through the automatic generation of fine-grained phonetic transcriptions

E Alsharhan, A Ramsay - Information Processing & Management, 2019 - Elsevier

This paper aims at determining the best way to exploit the phonological properties of the
Arabic language in order to improve the performance of the speech recognition system. One …

被引用次数：36 相关文章所有 2 个版本

[PDF] uchicago.edu

Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition

L Lu, A Ghoshal, S Renals - 2013 IEEE Workshop on Automatic …, 2013 - ieeexplore.ieee.org

Speech recognition systems normally use handcrafted pronunciation lexicons designed by
linguistic experts. Building and maintaining such a lexicon is expensive and time …

被引用次数：55 相关文章所有 9 个版本