{SMACK}: Semantically Meaningful Adversarial Audio Attack
Voice controllable systems rely on speech recognition and speaker identification as the key
enabling technologies. While they bring revolutionary changes to our daily lives, their …
enabling technologies. While they bring revolutionary changes to our daily lives, their …
Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition
M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …
through slow, uncoordinated control of speech production muscles. Automatic Speech …
Audio augmentation for non-native children's speech recognition through discriminative learning
Automatic speech recognition (ASR) in children is a rapidly evolving field, as children
become more accustomed to interacting with virtual assistants, such as Amazon Echo …
become more accustomed to interacting with virtual assistants, such as Amazon Echo …
Creating robust children's ASR system in zero-resource condition through out-of-domain data augmentation
Developing an automatic speech recognition (ASR) system for children's speech is
extremely challenging due to the unavailability of data from the child domain for the majority …
extremely challenging due to the unavailability of data from the child domain for the majority …
Accurate synthesis of dysarthric speech for asr data augmentation
Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …
through slow, uncoordinated control of speech production muscles. Automatic Speech …
Spectral warping and data augmentation for low resource language ASR system under mismatched conditions
The performance of an Automatic Speech Recognition System (ASR) system deteriorates
while using it on children speech, due to large variations and mismatch of acoustic and …
while using it on children speech, due to large variations and mismatch of acoustic and …
Data augmentation using spectral warping for low resource children asr
In low resource children automatic speech recognition (ASR) the performance is degraded
due to limited acoustic and speaker variability available in small datasets. In this paper, we …
due to limited acoustic and speaker variability available in small datasets. In this paper, we …
Synthesis speech based data augmentation for low resource children ASR
Successful speech recognition for children requires large training data with sufficient
speaker variability. The collection of such a training database of children's voices is …
speaker variability. The collection of such a training database of children's voices is …
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System
This paper describes the SPAPL system for the INTERSPEECH 2021 Challenge: Shared
Task on Automatic Speech Recognition for Non-Native Children's Speech in German.~ 5 …
Task on Automatic Speech Recognition for Non-Native Children's Speech in German.~ 5 …
Spectral warping based data augmentation for low resource children's speaker verification
In this paper, we present our effort to develop an automatic speaker verification (ASV)
system for low resources children's data. For the children's speakers, very limited amount of …
system for low resources children's data. For the children's speakers, very limited amount of …