Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation

H Wang, Z Jin, M Geng, S Hu, G Li… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Automatic recognition of dysarthric speech remains a highly challenging task to date. Neuro-
motor conditions and co-occurring physical disabilities create difficulty in large-scale data …

Improving the efficiency of dysarthria voice conversion system based on data augmentation

WZ Zheng, JY Han, CY Chen… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Dysarthria, a speech disorder often caused by neurological damage, compromises the
control of vocal muscles in patients, making their speech unclear and communication …

Use of speech impairment severity for dysarthric speech recognition

M Geng, Z Jin, T Wang, S Hu, J Deng, M Cui… - arXiv preprint arXiv …, 2023 - arxiv.org
A key challenge in dysarthric speech recognition is the speaker-level diversity attributed to
both speaker-identity associated factors such as gender, and speech impairment severity …

Uncovering the Potential for a Weakly Supervised End-to-End Model in Recognising Speech from Patient with Post-Stroke Aphasia

G Sanguedolce, PA Naylor… - Proceedings of the 5th …, 2023 - aclanthology.org
Post-stroke speech and language deficits (aphasia) significantly impact patients' quality of
life. Many with mild symptoms remain undiagnosed, and the majority do not receive the …

Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition

S Hu, X Xie, M Geng, Z Jin, J Deng, G Li… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Self-supervised learning (SSL) based speech foundation models have been applied to a
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …

Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading

S Luo, S Yang, S Shan, X Chen - arXiv preprint arXiv:2310.05058, 2023 - arxiv.org
In this paper, we propose a novel method for speaker adaptation in lip reading, motivated by
two observations. Firstly, a speaker's own characteristics can always be portrayed well by …

Towards Automatic Data Augmentation for Disordered Speech Recognition

Z Jin, X Xie, T Wang, M Geng, J Deng… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Automatic recognition of disordered speech remains a highly challenging task to date due to
data scarcity. This paper presents a reinforcement learning (RL) based on-the-fly data …

Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation

M Geng, X Xie, J Deng, Z Jin, G Li, T Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
The application of data-intensive automatic speech recognition (ASR) technologies to
dysarthric and elderly adult speech is confronted by their mismatch against healthy and …

Enhancing Terahertz Spectral Recognition of Lung Cancer Cells Through Synthetic Signal Generation

J Zheng, C Jia, M Zhao, F Shi, P Yu… - IEEE Sensors …, 2024 - ieeexplore.ieee.org
Deep learning-based medical diagnosis models heavily rely on large-scale datasets
encompassing diverse aspects of a patient's condition. However, the scarcity of such …

Debiased automatic speech recognition for dysarthric speech via sample reweighting with sample affinity test

E Kim, Y Chae, J Sim, K Lee - arXiv preprint arXiv:2305.13108, 2023 - arxiv.org
Automatic speech recognition systems based on deep learning are mainly trained under
empirical risk minimization (ERM). Since ERM utilizes the averaged performance on the …