Effect of pitch enhancement in Punjabi children's speech recognition system under disparate acoustic conditions
V Bhardwaj, V Kukreja - Applied Acoustics, 2021 - Elsevier
In this work, a Punjabi children speech recognition system is developed under different
acoustic matched and mismatched conditions. One major problem in children's speech …
acoustic matched and mismatched conditions. One major problem in children's speech …
Significance of vowel-like regions for speaker verification under degraded conditions
SRM Prasanna, G Pradhan - IEEE transactions on audio …, 2011 - ieeexplore.ieee.org
Vowel-like regions (VLRs) in speech includes vowels, semi-vowels, and diphthong sound
units. VLR can be identified using a vowel-like region onset point (VLROP) event. By …
units. VLR can be identified using a vowel-like region onset point (VLROP) event. By …
[图书][B] Speech processing in mobile environments
KS Rao, AK Vuppala - 2014 - Springer
Robust speech systems in mobile environment have gained a special interest in recent
years in order to enable access to remote voice-activated services. In this context, three …
years in order to enable access to remote voice-activated services. In this context, three …
Foreground speech segmentation and enhancement using glottal closure instants and mel cepstral coefficients
KT Deepak, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org
In this paper, the speech signal recorded from the desired speaker close to microphone in
natural environment is regarded as foreground speech and rest of the interfering sources as …
natural environment is regarded as foreground speech and rest of the interfering sources as …
A hybrid feature-extracted deep CNN with reduced parameters substitutes an End-to-End CNN for the recognition of spoken Bengali digits
B Paul, S Phadikar - Multimedia Tools and Applications, 2024 - Springer
Speech Recognition (SR) is an emerging field in the native language nowadays.
Recognizing isolated words in the local language helps people use smartphones and …
Recognizing isolated words in the local language helps people use smartphones and …
A novel pre-processing technique of amplitude interpolation for enhancing the classification accuracy of Bengali phonemes
B Paul, S Phadikar - Multimedia Tools and Applications, 2023 - Springer
In linguistics, phonemes are the atomic sound, called word segmentor play an important role
to recognize the word properly. A novel approach of seven Bengali vowels and ten …
to recognize the word properly. A novel approach of seven Bengali vowels and ten …
A continuous differentiable wavelet threshold function for speech enhancement
H Jia, X Zhang, J Bai - Journal of Central South University, 2013 - Springer
Enhanced speech based on the traditional wavelet threshold function had auditory
oscillation distortion and the low signal-to-noise ratio (SNR). In order to solve these …
oscillation distortion and the low signal-to-noise ratio (SNR). In order to solve these …
Enhancement of cleft palate speech using temporal and spectral processing
PN Sudro, SRM Prasanna - Speech Communication, 2020 - Elsevier
The speech of the individuals with cleft palate (CP) is generally characterized by the
presence of abnormal nasal resonances during the production of voiced sounds, primarily in …
presence of abnormal nasal resonances during the production of voiced sounds, primarily in …
We Care: Multimodal Depression Detection and Knowledge Infused Mental Health Therapeutic Response Generation
P Moon, P Bhattacharyya - arXiv preprint arXiv:2406.10561, 2024 - arxiv.org
The detection of depression through non-verbal cues has gained significant attention.
Previous research predominantly centred on identifying depression within the confines of …
Previous research predominantly centred on identifying depression within the confines of …
Speech enhancement using source information for phoneme recognition of speech with background music
This work explores the significance of source information for speech enhancement resulting
in better phoneme recognition of speech with background music segments. Standard …
in better phoneme recognition of speech with background music segments. Standard …