Computational intelligence in processing of speech acoustics: a survey

A Singh, N Kaur, V Kukreja, V Kadyan… - Complex & Intelligent …, 2022 - Springer
Speech recognition of a language is a key area in the field of pattern recognition. This paper
presents a comprehensive survey on the speech recognition techniques for non-Indian and …

[PDF][PDF] Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages.

HB Sailor, T Hain - Interspeech, 2020 - interspeech2020.org
This paper proposes a multilingual acoustic modeling approach for Indian languages using
a Multitask Learning (MTL) framework. Language-specific phoneme recognition is explored …

An exploration of semi-supervised and language-adversarial transfer learning using hybrid acoustic model for hindi speech recognition

A Kumar, RK Aggarwal - Journal of Reliable Intelligent Environments, 2022 - Springer
Semi-supervised training and language adversarial transfer learning are two different
techniques to improve the Automatic Speech Recognition (ASR) performance in limited …

A study of correlation between physiological process of articulation and emotions on Mandarin Chinese

Z Zhang, M Huang, Z Xiao - Speech Communication, 2023 - Elsevier
The goal of this work is to investigate the correlation between physiological process of
articulation and emotion expressing modes via speech of Mandarin Chinese. A bimodal …

[PDF][PDF] Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks.

MU Farooq, F Adeeba, S Rauf, S Hussain - INTERSPEECH, 2019 - isca-archive.org
Abstract Development of Large Vocabulary Continuous Speech Recognition (LVCSR)
system is a cumbersome task, especially for low resource languages. Urdu is the national …

An investigation of multilingual TDNN-BLSTM acoustic modeling for Hindi speech recognition

A Kumar, RK Aggarwal - International Journal of Sensors …, 2022 - ingentaconnect.com
Background: In India, thousands of languages or dialects are in use. Most Indian dialects are
low asset dialects. A well-performing Automatic Speech Recognition (ASR) system for Indian …

Articulatory-feature-based methods for performance improvement of Multilingual Phone Recognition Systems using Indian languages

KE Manjunath, DB Jayagopi, KS Rao… - Sādhanā, 2020 - Springer
In this work, the performance of Multilingual Phone Recognition System (Multi-PRS) is
improved using articulatory features (AFs). Four Indian languages–Kannada, Telugu …

Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN

MD Khan, R Ali, A Aziz - arXiv preprint arXiv:2307.12759, 2023 - arxiv.org
Call Centers have huge amount of audio data which can be used for achieving valuable
business insights and transcription of phone calls is manually tedious task. An effective …

End-to-end bengali speech recognition

S Mandal, S Yadav, A Rai - arXiv preprint arXiv:2009.09615, 2020 - arxiv.org
Bengali is a prominent language of the Indian subcontinent. However, while many state-of-
the-art acoustic models exist for prominent languages spoken in the region, research and …

Comparative Analysis for Speech Recognition Using DeepSpeech

C Singh, A Singh, H Upreti, S Bhatt… - 2024 International …, 2024 - ieeexplore.ieee.org
The presented work aims to implement speech recognition systems for Hindi and English
using advanced deep learning. Enhancing ASR technologies is crucial for improving digital …