Data augmentation using prosody and false starts to recognize non-native children's speech

Z Yu, Y Chang, N Zhang, C Xiao - 32nd USENIX Security Symposium …, 2023 - usenix.org

Voice controllable systems rely on speech recognition and speaker identification as the key
enabling technologies. While they bring revolutionary changes to our daily lives, their …

被引用次数：18 相关文章所有 4 个版本

[PDF] uky.edu

Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition

M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

被引用次数：28 相关文章所有 4 个版本

[PDF] mdpi.com

Audio augmentation for non-native children's speech recognition through discriminative learning

K Radha, M Bansal - Entropy, 2022 - mdpi.com

Automatic speech recognition (ASR) in children is a rapidly evolving field, as children
become more accustomed to interacting with virtual assistants, such as Amazon Echo …

被引用次数：19 相关文章所有 8 个版本

Creating robust children's ASR system in zero-resource condition through out-of-domain data augmentation

V Kumar, A Kumar, S Shahnawazuddin - Circuits, Systems, and Signal …, 2022 - Springer

Developing an automatic speech recognition (ASR) system for children's speech is
extremely challenging due to the unavailability of data from the child domain for the majority …

被引用次数：17 相关文章所有 4 个版本

[PDF] arxiv.org

Accurate synthesis of dysarthric speech for asr data augmentation

M Soleymanpour, MT Johnson, R Soleymanpour… - Speech …, 2024 - Elsevier

Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

被引用次数：4 相关文章所有 3 个版本

Spectral warping and data augmentation for low resource language ASR system under mismatched conditions

M Dua, V Kadyan, N Banthia, A Bansal, T Agarwal - Applied Acoustics, 2022 - Elsevier

The performance of an Automatic Speech Recognition System (ASR) system deteriorates
while using it on children speech, due to large variations and mismatch of acoustic and …

被引用次数：11 相关文章所有 2 个版本

[PDF] springer.com

Data augmentation using spectral warping for low resource children asr

HK Kathania, V Kadyan, SR Kadiri… - Journal of Signal …, 2022 - Springer

In low resource children automatic speech recognition (ASR) the performance is degraded
due to limited acoustic and speaker variability available in small datasets. In this paper, we …

被引用次数：9 相关文章所有 9 个版本

[PDF] aalto.fi

Synthesis speech based data augmentation for low resource children ASR

V Kadyan, H Kathania, P Govil, M Kurimo - Speech and Computer: 23rd …, 2021 - Springer

Successful speech recognition for children requires large training data with sufficient
speaker variability. The collection of such a training database of children's voices is …

被引用次数：11 相关文章所有 6 个版本

[PDF] arxiv.org

Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System

J Wang, Y Zhu, R Fan, W Chu, A Alwan - arXiv preprint arXiv:2106.09963, 2021 - arxiv.org

This paper describes the SPAPL system for the INTERSPEECH 2021 Challenge: Shared
Task on Automatic Speech Recognition for Non-Native Children's Speech in German.~ 5 …

被引用次数：12 相关文章所有 8 个版本

[PDF] springer.com

Spectral warping based data augmentation for low resource children's speaker verification

HK Kathania, V Kadyan, SR Kadiri… - Multimedia Tools and …, 2024 - Springer

In this paper, we present our effort to develop an automatic speaker verification (ASV)
system for low resources children's data. For the children's speakers, very limited amount of …

被引用次数：1 相关文章所有 6 个版本