{SMACK}: Semantically Meaningful Adversarial Audio Attack

Z Yu, Y Chang, N Zhang, C Xiao - 32nd USENIX Security Symposium …, 2023 - usenix.org
Voice controllable systems rely on speech recognition and speaker identification as the key
enabling technologies. While they bring revolutionary changes to our daily lives, their …

Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition

M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

Audio augmentation for non-native children's speech recognition through discriminative learning

K Radha, M Bansal - Entropy, 2022 - mdpi.com
Automatic speech recognition (ASR) in children is a rapidly evolving field, as children
become more accustomed to interacting with virtual assistants, such as Amazon Echo …

Creating robust children's ASR system in zero-resource condition through out-of-domain data augmentation

V Kumar, A Kumar, S Shahnawazuddin - Circuits, Systems, and Signal …, 2022 - Springer
Developing an automatic speech recognition (ASR) system for children's speech is
extremely challenging due to the unavailability of data from the child domain for the majority …

Accurate synthesis of dysarthric speech for asr data augmentation

M Soleymanpour, MT Johnson, R Soleymanpour… - Speech …, 2024 - Elsevier
Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

Spectral warping and data augmentation for low resource language ASR system under mismatched conditions

M Dua, V Kadyan, N Banthia, A Bansal, T Agarwal - Applied Acoustics, 2022 - Elsevier
The performance of an Automatic Speech Recognition System (ASR) system deteriorates
while using it on children speech, due to large variations and mismatch of acoustic and …

Data augmentation using spectral warping for low resource children asr

HK Kathania, V Kadyan, SR Kadiri… - Journal of Signal …, 2022 - Springer
In low resource children automatic speech recognition (ASR) the performance is degraded
due to limited acoustic and speaker variability available in small datasets. In this paper, we …

Synthesis speech based data augmentation for low resource children ASR

V Kadyan, H Kathania, P Govil, M Kurimo - Speech and Computer: 23rd …, 2021 - Springer
Successful speech recognition for children requires large training data with sufficient
speaker variability. The collection of such a training database of children's voices is …

Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System

J Wang, Y Zhu, R Fan, W Chu, A Alwan - arXiv preprint arXiv:2106.09963, 2021 - arxiv.org
This paper describes the SPAPL system for the INTERSPEECH 2021 Challenge: Shared
Task on Automatic Speech Recognition for Non-Native Children's Speech in German.~ 5 …

Spectral warping based data augmentation for low resource children's speaker verification

HK Kathania, V Kadyan, SR Kadiri… - Multimedia Tools and …, 2024 - Springer
In this paper, we present our effort to develop an automatic speaker verification (ASV)
system for low resources children's data. For the children's speakers, very limited amount of …