Effect of prosody modification on children's ASR

S Shahnawazuddin, N Adiga, HK Kathania… - Pattern Recognition …, 2020 - Elsevier

In this paper, the effect of prosody-modification-based data augmentation is explored in the
context of automatic speech recognition (ASR). The primary motive is to develop ASR …

被引用次数：54 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] A formant modification method for improved ASR of children's speech

HK Kathania, SR Kadiri, P Alku, M Kurimo - Speech Communication, 2022 - Elsevier

Differences in acoustic characteristics between children's and adults' speech degrade
performance of automatic speech recognition systems when systems trained using adults' …

被引用次数：25 相关文章所有 7 个版本

[PDF] ieee.org

A text-to-speech pipeline, evaluation methodology, and initial fine-tuning results for child speech synthesis

R Jain, MY Yiwere, D Bigioi, P Corcoran… - IEEE Access, 2022 - ieeexplore.ieee.org

Speech synthesis has come a long way as current text-to-speech (TTS) models can now
generate natural human-sounding speech. However, most of the TTS research focuses on …

被引用次数：18 相关文章所有 4 个版本

Addressing noise and pitch sensitivity of speech recognition system through variational mode decomposition based spectral smoothing

IC Yadav, S Shahnawazuddin, G Pradhan - Digital Signal Processing, 2019 - Elsevier

In this paper, we propose a novel front-end speech parameterization technique for automatic
speech recognition (ASR) that is less sensitive towards ambient noise and pitch variations …

被引用次数：43 相关文章所有 2 个版本

In-domain and out-of-domain data augmentation to improve children's speaker verification system in limited data scenario

S Shahnawazuddin, W Ahmad… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

In this paper, we present our efforts towards developing a robust automatic speaker
verification (ASV) system for children when the domain-specific data is limited. For that …

被引用次数：30 相关文章

Creating robust children's ASR system in zero-resource condition through out-of-domain data augmentation

V Kumar, A Kumar, S Shahnawazuddin - Circuits, Systems, and Signal …, 2022 - Springer

Developing an automatic speech recognition (ASR) system for children's speech is
extremely challenging due to the unavailability of data from the child domain for the majority …

被引用次数：17 相关文章所有 4 个版本

Spectral warping and data augmentation for low resource language ASR system under mismatched conditions

M Dua, V Kadyan, N Banthia, A Bansal, T Agarwal - Applied Acoustics, 2022 - Elsevier

The performance of an Automatic Speech Recognition System (ASR) system deteriorates
while using it on children speech, due to large variations and mismatch of acoustic and …

被引用次数：11 相关文章所有 2 个版本

Children's speaker verification in low and zero resource conditions

S Shahnawazuddin, W Ahmad, N Adiga… - Digital Signal …, 2021 - Elsevier

Our efforts towards developing an automatic speaker verification (ASV) system for child
speakers are presented in this paper. For the majority of the languages, children's speech …

被引用次数：16 相关文章所有 3 个版本

Developing children's ASR system under low-resource conditions using end-to-end architecture

S Shahnawazuddin - Digital Signal Processing, 2024 - Elsevier

The work presented in this paper aims at enhancing the performance of end-to-end (E2E)
speech recognition task for children's speech under low resource conditions. For majority of …

被引用次数：5 相关文章所有 2 个版本

[PDF] academia.edu

Developing speaker independent ASR system using limited data through prosody modification based on fuzzy classification of spectral bins

S Shahnawazuddin, N Adiga, BT Sai, W Ahmad… - Digital Signal …, 2019 - Elsevier

The primary motive of this study is to develop an automatic speech recognition (ASR) system
using limited amount of speech data such that it is least affected by speaker-dependent …

被引用次数：21 相关文章所有 4 个版本