Spectro-temporal deep features for disordered speech assessment and recognition

M Geng, S Liu, J Yu, X Xie, S Hu, Z Ye, Z Jin… - arXiv preprint arXiv …, 2022 - arxiv.org
Automatic recognition of disordered speech remains a highly challenging task to date.
Sources of variability commonly found in normal speech including accent, age or gender …

Speaker adaptation using spectro-temporal deep features for dysarthric and elderly speech recognition

M Geng, X Xie, Z Ye, T Wang, G Li, S Hu… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech in recent decades, accurate recognition of dysarthric and elderly speech …

Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition

S Hu, S Liu, X Xie, M Geng, T Wang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Articulatory features are inherently invariant to acoustic signal distortion and have been
successfully incorporated into automatic speech recognition (ASR) systems for normal …

Use of speech impairment severity for dysarthric speech recognition

M Geng, Z Jin, T Wang, S Hu, J Deng, M Cui… - arXiv preprint arXiv …, 2023 - arxiv.org
A key challenge in dysarthric speech recognition is the speaker-level diversity attributed to
both speaker-identity associated factors such as gender, and speech impairment severity …

Recent progress in the CUHK dysarthric speech recognition system

S Liu, M Geng, S Hu, X Xie, M Cui, J Yu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies in the past
few decades, recognition of disordered speech remains a highly challenging task to date …

Speaker adaptation for Wav2vec2 based dysarthric ASR

MK Baskar, T Herzig, D Nguyen, M Diez… - arXiv preprint arXiv …, 2022 - arxiv.org
Dysarthric speech recognition has posed major challenges due to lack of training data and
heavy mismatch in speaker characteristics. Recent ASR systems have benefited from readily …

Adversarial data augmentation for disordered speech recognition

Z Jin, M Geng, X Xie, J Yu, S Liu, X Liu… - arXiv preprint arXiv …, 2021 - arxiv.org
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …

Personalized adversarial data augmentation for dysarthric and elderly speech recognition

Z Jin, M Geng, J Deng, T Wang, S Hu… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech, accurate recognition of dysarthric and elderly speech remains a highly …

Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation

H Wang, Z Jin, M Geng, S Hu, G Li… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Automatic recognition of dysarthric speech remains a highly challenging task to date. Neuro-
motor conditions and co-occurring physical disabilities create difficulty in large-scale data …

Investigation of data augmentation techniques for disordered speech recognition

M Geng, X Xie, S Liu, J Yu, S Hu, X Liu… - arXiv preprint arXiv …, 2022 - arxiv.org
Disordered speech recognition is a highly challenging task. The underlying neuro-motor
conditions of people with speech disorders, often compounded with co-occurring physical …