Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

Q Shao, P Guo, J Yan, P Hu… - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
Accents pose significant challenges for speech recognition systems. Although joint
automatic speech recognition (ASR) and accent recognition (AR) training has been proven …

[PDF][PDF] Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis

T Bhattacharjee, A Jayakumar, Y Belur… - Proc …, 2023 - isca-archive.org
A major challenge involved in automatic dysarthria severity classification for patients with
Amyotrophic Lateral Sclerosis (ALS) is the difficulty to build a speech corpus which is large …

Non-native english lexicon creation for bilingual speech synthesis

A Baby, P Jawale, S Vinnaitherthan, S Badam… - arXiv preprint arXiv …, 2021 - arxiv.org
Bilingual English speakers speak English as one of their languages. Their English is of a
non-native kind, and their conversations are of a code-mixed fashion. The intelligibility of a …

SPIRE-SIES: A Spontaneous Indian English Speech Corpus

A Singh, C Shah, R Varadaraj… - … 26th Conference of …, 2023 - ieeexplore.ieee.org
In this paper, we present a 170.83 hour Indian English spontaneous speech dataset. Lack of
Indian English speech data is one of the major hindrances in developing robust speech …

[PDF][PDF] Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss

J Bandekar, S Udupa, PK Ghosh - Proc. Interspeech 2024, 2024 - isca-archive.org
Articulatory speech synthesis is a challenging task which requires mapping of time-varying
articulatory trajectories and speech. In recent years, deep learning methods have been …

An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations

S Jain, P Pal, A Vuppala, P Ghosh, C Yarra - arXiv preprint arXiv …, 2022 - arxiv.org
Speech systems are sensitive to accent variations. This is especially challenging in the
Indian context, with an abundance of languages but a dearth of linguistic studies …

Pseudo likelihood correction technique for low resource accented asr

A Rajpal, A Rao, C Yarra, R Aggarwal… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
With the availability of large data, ASRs perform well on native English but poorly for non-
native English data. Training nonnative ASRs or adapting a native English ASR is often …

IE-CPS Lexicon: An automatic speech recognition oriented Indian-English pronunciation dictionary

S Jain, A Yadavalli, G Mirishkar, C Yarra… - Proceedings of the …, 2021 - aclanthology.org
Indian English (IE), on the surface, seems quite similar to standard English. However, closer
observation shows that it has actually been influenced by the surrounding vernacular …

G-Cocktail: An Algorithm to Address Cocktail Party Problem of Gujarati Language Using Cat Boost

M Gupta, RK Singh, S Singh - Wireless Personal Communications, 2022 - Springer
This paper is an attempt to address to the problem of native language in a mixed voice
environment. G-Cocktail would aid these applications in identifying commands given in …

Study of Indian English pronunciation variabilities relative to Received Pronunciation

P Pal, S Jain, C Yarra, PK Ghosh… - … Conference on Speech …, 2023 - Springer
Abstract Analysis of Indian English (IE) pronunciation variabilities is useful in ASR and TTS
modelling for the Indian context. Prior works characterised IE variabilities by reporting …