[HTML][HTML] Automatic Speech Recognition: A survey of deep learning techniques and approaches

H Ahlawat, N Aggarwal, D Gupta - International Journal of Cognitive …, 2025 - Elsevier
Significant research has been conducted during the last decade on the application of
machine learning for speech processing, particularly speech recognition. However, in recent …

Multilingual speech corpus in low-resource eastern and northeastern Indian languages for speaker and language identification

J Basu, S Khan, R Roy, TK Basu… - Circuits, Systems, and …, 2021 - Springer
Research and development of speech technology applications in low-resource languages
(LRL) are challenging due to the non-availability of proper speech corpus. Especially, for …

Annotated speech corpus for low resource Indian languages: Awadhi, Bhojpuri, Braj and Magahi

R Kumar, S Singh, S Ratan, M Raj, S Sinha… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper we discuss an in-progress work on the development of a speech corpus for four
low-resource Indo-Aryan languages--Awadhi, Bhojpuri, Braj and Magahi using the field …

CoRePooL—Corpus for Resource‐Poor Languages: Badaga Speech Corpus

HB Barathi Ganesh, G Jyothish Lal… - … and Translation for …, 2024 - Wiley Online Library
This chapter presents a corpus named CoRePooL that stands for Corpus for Resource‐Poor
Languages. As voice‐specific human‐machine interaction applications are accelerated by …

A Novel Approach for Bootstrapping and Automatic Transcription of Low Resourced Language Speech Corpus

MK Roy, KK Arora, J Basu, S Basu… - … 26th Conference of …, 2023 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) systems have made significant advancements in the
context of high-resource languages, primarily attributable to the abundant availability of …

Voistutor 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners

P Pal, C Yarra, PK Ghosh - 2022 25th Conference of the …, 2022 - ieeexplore.ieee.org
In computer assisted pronunciation training (CAPT), robust automatic models are critical for
pronunciation assessment and mispronunciation detection and diagnosis (MDD). In the …

Data Collection and Development of Bengali ASR and TTS for Conversational AI-based Automated Advisories in the Agriculture domain

S Khan, T Basu, J Basu, M Pal, R Roy… - 2022 4th International …, 2022 - ieeexplore.ieee.org
This paper presents an indigenous work of text and speech data collection and organization
in the agriculture domain and developing a structured Agriculture Knowledge Repository …

[PDF][PDF] Collecting Speech Data for Endangered and Under-resourced Indian Languages

R Kumar, M Takhellambam, B Lahiri… - Proc. 2nd Annual …, 2023 - sigul-2023.ilc.cnr.it
The preparation of speech corpora for languages un (der) represented on the web largely
depends on the manual methods of data collection and processing from different sources …

[PDF][PDF] Building Speech Corpus in Rapid Manner to Adapt a General Purpose ASR System to Specific Domain

MK Roy, S Arora, K Arora, SS Agarwal - 2021 - easychair.org
The situation prevalent due to Covid-19 has affected the traditional speech database
collection process by reaching out persons in one-to-one manner. In this paper, we describe …