[HTML][HTML] Automatic Speech Recognition: A survey of deep learning techniques and approaches
H Ahlawat, N Aggarwal, D Gupta - International Journal of Cognitive …, 2025 - Elsevier
Significant research has been conducted during the last decade on the application of
machine learning for speech processing, particularly speech recognition. However, in recent …
machine learning for speech processing, particularly speech recognition. However, in recent …
Multilingual speech corpus in low-resource eastern and northeastern Indian languages for speaker and language identification
Research and development of speech technology applications in low-resource languages
(LRL) are challenging due to the non-availability of proper speech corpus. Especially, for …
(LRL) are challenging due to the non-availability of proper speech corpus. Especially, for …
Annotated speech corpus for low resource Indian languages: Awadhi, Bhojpuri, Braj and Magahi
R Kumar, S Singh, S Ratan, M Raj, S Sinha… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper we discuss an in-progress work on the development of a speech corpus for four
low-resource Indo-Aryan languages--Awadhi, Bhojpuri, Braj and Magahi using the field …
low-resource Indo-Aryan languages--Awadhi, Bhojpuri, Braj and Magahi using the field …
CoRePooL—Corpus for Resource‐Poor Languages: Badaga Speech Corpus
HB Barathi Ganesh, G Jyothish Lal… - … and Translation for …, 2024 - Wiley Online Library
This chapter presents a corpus named CoRePooL that stands for Corpus for Resource‐Poor
Languages. As voice‐specific human‐machine interaction applications are accelerated by …
Languages. As voice‐specific human‐machine interaction applications are accelerated by …
A Novel Approach for Bootstrapping and Automatic Transcription of Low Resourced Language Speech Corpus
Automatic Speech Recognition (ASR) systems have made significant advancements in the
context of high-resource languages, primarily attributable to the abundant availability of …
context of high-resource languages, primarily attributable to the abundant availability of …
Voistutor 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners
In computer assisted pronunciation training (CAPT), robust automatic models are critical for
pronunciation assessment and mispronunciation detection and diagnosis (MDD). In the …
pronunciation assessment and mispronunciation detection and diagnosis (MDD). In the …
Data Collection and Development of Bengali ASR and TTS for Conversational AI-based Automated Advisories in the Agriculture domain
This paper presents an indigenous work of text and speech data collection and organization
in the agriculture domain and developing a structured Agriculture Knowledge Repository …
in the agriculture domain and developing a structured Agriculture Knowledge Repository …
[PDF][PDF] Collecting Speech Data for Endangered and Under-resourced Indian Languages
R Kumar, M Takhellambam, B Lahiri… - Proc. 2nd Annual …, 2023 - sigul-2023.ilc.cnr.it
The preparation of speech corpora for languages un (der) represented on the web largely
depends on the manual methods of data collection and processing from different sources …
depends on the manual methods of data collection and processing from different sources …
[PDF][PDF] Building Speech Corpus in Rapid Manner to Adapt a General Purpose ASR System to Specific Domain
The situation prevalent due to Covid-19 has affected the traditional speech database
collection process by reaching out persons in one-to-one manner. In this paper, we describe …
collection process by reaching out persons in one-to-one manner. In this paper, we describe …