Adversarial data augmentation using vae-gan for disordered speech recognition

Z Jin, X Xie, M Geng, T Wang, S Hu… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …

Adversarial data augmentation for disordered speech recognition

Z Jin, M Geng, X Xie, J Yu, S Liu, X Liu… - arXiv preprint arXiv …, 2021 - arxiv.org
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …

Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation

H Wang, Z Jin, M Geng, S Hu, G Li… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Automatic recognition of dysarthric speech remains a highly challenging task to date. Neuro-
motor conditions and co-occurring physical disabilities create difficulty in large-scale data …

Personalized adversarial data augmentation for dysarthric and elderly speech recognition

Z Jin, M Geng, J Deng, T Wang, S Hu… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech, accurate recognition of dysarthric and elderly speech remains a highly …

Recent progress in the CUHK dysarthric speech recognition system

S Liu, M Geng, S Hu, X Xie, M Cui, J Yu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies in the past
few decades, recognition of disordered speech remains a highly challenging task to date …

Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition

S Hu, S Liu, X Xie, M Geng, T Wang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Articulatory features are inherently invariant to acoustic signal distortion and have been
successfully incorporated into automatic speech recognition (ASR) systems for normal …

Use of speech impairment severity for dysarthric speech recognition

M Geng, Z Jin, T Wang, S Hu, J Deng, M Cui… - arXiv preprint arXiv …, 2023 - arxiv.org
A key challenge in dysarthric speech recognition is the speaker-level diversity attributed to
both speaker-identity associated factors such as gender, and speech impairment severity …

[PDF][PDF] Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.

S Liu, S Hu, Y Wang, J Yu, R Su, X Liu, H Meng - INTERSPEECH, 2019 - isca-archive.org
Automatic speech recognition (ASR) for disordered speech is a challenging task. People
with speech disorders such as dysarthria often have physical disabilities, leading to severe …

Data augmentation using conditional generative adversarial networks for robust speech recognition

P Sheng, Z Yang, H Hu, T Tan… - 2018 11th international …, 2018 - ieeexplore.ieee.org
For noise robust speech recognition, data mismatch between training and test is a significant
challenge. To reduce this mismatch, traditional approach of data augmentation usually adds …

[PDF][PDF] Adversarial Feature-Mapping for Speech Enhancement.

Z Meng, J Li, Y Gong, BHF Juang - Interspeech, 2018 - isca-archive.org
Feature-mapping with deep neural networks is commonly used for single-channel speech
enhancement, in which a feature-mapping network directly transforms the noisy features to …