Synthetic data augmentation for improving low-resource ASR

B Thai, R Jimerson, D Arcoraci… - 2019 IEEE Western …, 2019 - ieeexplore.ieee.org
Although the application of deep learning to automatic speech recognition (ASR) has
resulted in dramatic reductions in word error rate for languages with abundant training data …

[PDF][PDF] Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages.

HB Sailor, T Hain - Interspeech, 2020 - interspeech2020.org
This paper proposes a multilingual acoustic modeling approach for Indian languages using
a Multitask Learning (MTL) framework. Language-specific phoneme recognition is explored …

The multi-domain international search on speech 2020 albayzin evaluation: Overview, systems, results, discussion and post-evaluation analyses

J Tejedor, DT Toledano, JM Ramirez, AR Montalvo… - Applied Sciences, 2021 - mdpi.com
The large amount of information stored in audio and video repositories makes search on
speech (SoS) a challenging area that is continuously receiving much interest. Within SoS …

[PDF][PDF] One size does not fit all in resource-constrained ASR

E Morris, R Jimerson, E Prud'hommeaux - … of INTERSPEECH 2021, 2021 - par.nsf.gov
The application of deep neural networks to the task of acoustic modeling for automatic
speech recognition has resulted in dramatic decreases in ASR word error rates, enabling …

From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language

M Sharif, Z Abbas, J Yi, C Liu - arXiv preprint arXiv:2411.14493, 2024 - arxiv.org
Automatic Speech Recognition (ASR) technology has witnessed significant advancements
in recent years, revolutionizing human-computer interactions. While major languages have …

Whisper Finetuning on Nepali Language

S Rijal, S Adhikari, M Dahal, M Awale… - arXiv preprint arXiv …, 2024 - arxiv.org
Despite the growing advancements in Automatic Speech Recognition (ASR) models, the
development of robust models for underrepresented languages, such as Nepali, remains a …

[图书][B] Automatic speech recognition for low-resource and morphologically complex languages

E Morris - 2021 - search.proquest.com
The application of deep neural networks to the task of acoustic modeling for automatic
speech recognition (ASR) has resulted in dramatic decreases of word error rates, allowing …

4 Speech Recognition for Persian

M Vafaie, J Dehdari - Persian Computational Linguistics and NLP, 2023 - degruyter.com
Automatic Speech Recognition (ASR) is a cross-disciplinary field that enablescomputerstoprocesshumanspeechi…
meaning of it with the help of other NLP technologies such as natural language …

Data Selection Based on Phoneme Affinity Matrix for Electrolarynx Speech Recognition

IT Hsieh, CH Wu, SW Tsa - 2023 Asia Pacific Signal and …, 2023 - ieeexplore.ieee.org
Electrolarynx (EL) is a communicative aid for the patient after laryngectomy to generate
communicable speech. Since EL speech exhibits low speech intelligibility and produces …

[PDF][PDF] Motivations, challenges, and perspectives for the development of an Automatic Speech Recognition System for the under-resourced Ngiemboon Language

P Yemmene, L Besacier - … of the First International Workshop on …, 2019 - aclanthology.org
Nowadays, a broad range of speech recognition technologies (such as Apple Siri and
Amazon Alexa) are developed as the user interface has become ever convenient and …