Effect of pitch enhancement in Punjabi children's speech recognition system under disparate acoustic conditions

V Bhardwaj, V Kukreja - Applied Acoustics, 2021 - Elsevier
In this work, a Punjabi children speech recognition system is developed under different
acoustic matched and mismatched conditions. One major problem in children's speech …

Significance of vowel-like regions for speaker verification under degraded conditions

SRM Prasanna, G Pradhan - IEEE transactions on audio …, 2011 - ieeexplore.ieee.org
Vowel-like regions (VLRs) in speech includes vowels, semi-vowels, and diphthong sound
units. VLR can be identified using a vowel-like region onset point (VLROP) event. By …

[图书][B] Speech processing in mobile environments

KS Rao, AK Vuppala - 2014 - Springer
Robust speech systems in mobile environment have gained a special interest in recent
years in order to enable access to remote voice-activated services. In this context, three …

Foreground speech segmentation and enhancement using glottal closure instants and mel cepstral coefficients

KT Deepak, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org
In this paper, the speech signal recorded from the desired speaker close to microphone in
natural environment is regarded as foreground speech and rest of the interfering sources as …

A hybrid feature-extracted deep CNN with reduced parameters substitutes an End-to-End CNN for the recognition of spoken Bengali digits

B Paul, S Phadikar - Multimedia Tools and Applications, 2024 - Springer
Speech Recognition (SR) is an emerging field in the native language nowadays.
Recognizing isolated words in the local language helps people use smartphones and …

A novel pre-processing technique of amplitude interpolation for enhancing the classification accuracy of Bengali phonemes

B Paul, S Phadikar - Multimedia Tools and Applications, 2023 - Springer
In linguistics, phonemes are the atomic sound, called word segmentor play an important role
to recognize the word properly. A novel approach of seven Bengali vowels and ten …

A continuous differentiable wavelet threshold function for speech enhancement

H Jia, X Zhang, J Bai - Journal of Central South University, 2013 - Springer
Enhanced speech based on the traditional wavelet threshold function had auditory
oscillation distortion and the low signal-to-noise ratio (SNR). In order to solve these …

Enhancement of cleft palate speech using temporal and spectral processing

PN Sudro, SRM Prasanna - Speech Communication, 2020 - Elsevier
The speech of the individuals with cleft palate (CP) is generally characterized by the
presence of abnormal nasal resonances during the production of voiced sounds, primarily in …

We Care: Multimodal Depression Detection and Knowledge Infused Mental Health Therapeutic Response Generation

P Moon, P Bhattacharyya - arXiv preprint arXiv:2406.10561, 2024 - arxiv.org
The detection of depression through non-verbal cues has gained significant attention.
Previous research predominantly centred on identifying depression within the confines of …

Speech enhancement using source information for phoneme recognition of speech with background music

BK Khonglah, A Dey, SRM Prasanna - Circuits, Systems, and Signal …, 2019 - Springer
This work explores the significance of source information for speech enhancement resulting
in better phoneme recognition of speech with background music segments. Standard …