Towards Automatic Data Augmentation for Disordered Speech Recognition

Z Jin, X Xie, T Wang, M Geng, J Deng… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Automatic recognition of disordered speech remains a highly challenging task to date due to
data scarcity. This paper presents a reinforcement learning (RL) based on-the-fly data …

Recent progress in the CUHK dysarthric speech recognition system

S Liu, M Geng, S Hu, X Xie, M Cui, J Yu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies in the past
few decades, recognition of disordered speech remains a highly challenging task to date …

Adversarial data augmentation for disordered speech recognition

Z Jin, M Geng, X Xie, J Yu, S Liu, X Liu… - arXiv preprint arXiv …, 2021 - arxiv.org
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …

Adversarial data augmentation using vae-gan for disordered speech recognition

Z Jin, X Xie, M Geng, T Wang, S Hu… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …

Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition

S Hu, S Liu, X Xie, M Geng, T Wang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Articulatory features are inherently invariant to acoustic signal distortion and have been
successfully incorporated into automatic speech recognition (ASR) systems for normal …

Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition

T Wang, S Hu, J Deng, Z Jin, M Geng, Y Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Automatic recognition of disordered and elderly speech remains highly challenging tasks to
date due to data scarcity. Parameter fine-tuning is often used to exploit the large quantities of …

Spectro-temporal deep features for disordered speech assessment and recognition

M Geng, S Liu, J Yu, X Xie, S Hu, Z Ye, Z Jin… - arXiv preprint arXiv …, 2022 - arxiv.org
Automatic recognition of disordered speech remains a highly challenging task to date.
Sources of variability commonly found in normal speech including accent, age or gender …

Exploring self-supervised pre-trained asr models for dysarthric and elderly speech recognition

S Hu, X Xie, Z Jin, M Geng, Y Wang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Automatic recognition of disordered and elderly speech remains a highly challenging task to
date due to the difficulty in collecting such data in large quantities. This paper explores a …

[PDF][PDF] Improved ASR Performance for Dysarthric Speech Using Two-stage DataAugmentation.

C Bhat, A Panda, H Strik - INTERSPEECH, 2022 - isca-archive.org
Abstract Machine learning (ML) and Deep Neural Networks (DNN) have greatly aided the
problem of Automatic Speech Recognition (ASR). However, accurate ASR for dysarthric …

Disordered speech recognition considering low resources and abnormal articulation

Y Lin, L Wang, J Dang, S Li, C Ding - Speech Communication, 2023 - Elsevier
The success of automatic speech recognition (ASR) benefits a great number of healthy
people, but not people with disorders. The speech disordered may truly need support from …