Adversarial data augmentation for disordered speech recognition

Z Jin, M Geng, X Xie, J Yu, S Liu, X Liu… - arXiv preprint arXiv …, 2021 - arxiv.org
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …

Adversarial data augmentation using vae-gan for disordered speech recognition

Z Jin, X Xie, M Geng, T Wang, S Hu… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …

Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation

H Wang, Z Jin, M Geng, S Hu, G Li… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Automatic recognition of dysarthric speech remains a highly challenging task to date. Neuro-
motor conditions and co-occurring physical disabilities create difficulty in large-scale data …

Personalized adversarial data augmentation for dysarthric and elderly speech recognition

Z Jin, M Geng, J Deng, T Wang, S Hu… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech, accurate recognition of dysarthric and elderly speech remains a highly …

Recent progress in the CUHK dysarthric speech recognition system

S Liu, M Geng, S Hu, X Xie, M Cui, J Yu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies in the past
few decades, recognition of disordered speech remains a highly challenging task to date …

Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition

S Hu, S Liu, X Xie, M Geng, T Wang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Articulatory features are inherently invariant to acoustic signal distortion and have been
successfully incorporated into automatic speech recognition (ASR) systems for normal …

Spectro-temporal deep features for disordered speech assessment and recognition

M Geng, S Liu, J Yu, X Xie, S Hu, Z Ye, Z Jin… - arXiv preprint arXiv …, 2022 - arxiv.org
Automatic recognition of disordered speech remains a highly challenging task to date.
Sources of variability commonly found in normal speech including accent, age or gender …

[PDF][PDF] Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.

S Liu, S Hu, Y Wang, J Yu, R Su, X Liu, H Meng - INTERSPEECH, 2019 - isca-archive.org
Automatic speech recognition (ASR) for disordered speech is a challenging task. People
with speech disorders such as dysarthria often have physical disabilities, leading to severe …

Source domain data selection for improved transfer learning targeting dysarthric speech recognition

F Xiong, J Barker, Z Yue… - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
This paper presents an improved transfer learning framework applied to robust personalised
speech recognition models for speakers with dysarthria. As the baseline of transfer learning …

Investigation of data augmentation techniques for disordered speech recognition

M Geng, X Xie, S Liu, J Yu, S Hu, X Liu… - arXiv preprint arXiv …, 2022 - arxiv.org
Disordered speech recognition is a highly challenging task. The underlying neuro-motor
conditions of people with speech disorders, often compounded with co-occurring physical …