Enhancement of noisy speech by temporal and spectral processing

Effect of pitch enhancement in Punjabi children's speech recognition system under disparate acoustic conditions

V Bhardwaj, V Kukreja - Applied Acoustics, 2021 - Elsevier

In this work, a Punjabi children speech recognition system is developed under different
acoustic matched and mismatched conditions. One major problem in children's speech …

被引用次数：73 相关文章

Significance of vowel-like regions for speaker verification under degraded conditions

SRM Prasanna, G Pradhan - IEEE transactions on audio …, 2011 - ieeexplore.ieee.org

Vowel-like regions (VLRs) in speech includes vowels, semi-vowels, and diphthong sound
units. VLR can be identified using a vowel-like region onset point (VLROP) event. By …

被引用次数：91 相关文章所有 4 个版本

[图书][B] Speech processing in mobile environments

KS Rao, AK Vuppala - 2014 - Springer

Robust speech systems in mobile environment have gained a special interest in recent
years in order to enable access to remote voice-activated services. In this context, three …

被引用次数：38 相关文章所有 6 个版本

Foreground speech segmentation and enhancement using glottal closure instants and mel cepstral coefficients

KT Deepak, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org

In this paper, the speech signal recorded from the desired speaker close to microphone in
natural environment is regarded as foreground speech and rest of the interfering sources as …

被引用次数：26 相关文章所有 3 个版本

A hybrid feature-extracted deep CNN with reduced parameters substitutes an End-to-End CNN for the recognition of spoken Bengali digits

B Paul, S Phadikar - Multimedia Tools and Applications, 2024 - Springer

Speech Recognition (SR) is an emerging field in the native language nowadays.
Recognizing isolated words in the local language helps people use smartphones and …

被引用次数：4 相关文章所有 3 个版本

[PDF] researchgate.net

A novel pre-processing technique of amplitude interpolation for enhancing the classification accuracy of Bengali phonemes

B Paul, S Phadikar - Multimedia Tools and Applications, 2023 - Springer

In linguistics, phonemes are the atomic sound, called word segmentor play an important role
to recognize the word properly. A novel approach of seven Bengali vowels and ten …

被引用次数：4 相关文章所有 5 个版本

A continuous differentiable wavelet threshold function for speech enhancement

H Jia, X Zhang, J Bai - Journal of Central South University, 2013 - Springer

Enhanced speech based on the traditional wavelet threshold function had auditory
oscillation distortion and the low signal-to-noise ratio (SNR). In order to solve these …

被引用次数：22 相关文章所有 4 个版本

Enhancement of cleft palate speech using temporal and spectral processing

PN Sudro, SRM Prasanna - Speech Communication, 2020 - Elsevier

The speech of the individuals with cleft palate (CP) is generally characterized by the
presence of abnormal nasal resonances during the production of voiced sounds, primarily in …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

We Care: Multimodal Depression Detection and Knowledge Infused Mental Health Therapeutic Response Generation

P Moon, P Bhattacharyya - arXiv preprint arXiv:2406.10561, 2024 - arxiv.org

The detection of depression through non-verbal cues has gained significant attention.
Previous research predominantly centred on identifying depression within the confines of …

被引用次数：3 相关文章

Speech enhancement using source information for phoneme recognition of speech with background music

BK Khonglah, A Dey, SRM Prasanna - Circuits, Systems, and Signal …, 2019 - Springer

This work explores the significance of source information for speech enhancement resulting
in better phoneme recognition of speech with background music segments. Standard …

被引用次数：11 相关文章所有 4 个版本