Cross-lingual self-supervised speech representations for improved dysarthric speech recognition

A Hernandez, PA Pérez-Toro, E Nöth… - arXiv preprint arXiv …, 2022 - arxiv.org
State-of-the-art automatic speech recognition (ASR) systems perform well on healthy
speech. However, the performance on impaired speech still remains an issue. The current …

Synthesis of new words for improved dysarthric speech recognition on an expanded vocabulary

J Harvill, D Issa… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Dysarthria is a condition where people experience a reduction in speech intelligibility due to
a neuromotor disorder. Previous works in dysarthric speech recognition have focused on …

Phonetic posteriorgram-based voice conversion system to improve speech intelligibility of dysarthric patients

WZ Zheng, JY Han, CK Lee, YY Lin, SH Chang… - Computer Methods and …, 2022 - Elsevier
Abstract Background and Objective Most dysarthric patients encounter communication
problems due to unintelligible speech. Currently, there are many voice-driven systems …

Extending parrotron: An end-to-end, speech conversion and speech recognition model for atypical speech

R Doshi, Y Chen, L Jiang, X Zhang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
We present an extended Parrotron model: a single, end-to-end network that enables voice
conversion and recognition simultaneously. Input spectrograms are transformed to output …

A speech command control-based recognition system for dysarthric patients based on deep learning technology

YY Lin, WZ Zheng, WC Chu, JY Han, YH Hung… - Applied Sciences, 2021 - mdpi.com
Voice control is an important way of controlling mobile devices; however, using it remains a
challenge for dysarthric patients. Currently, there are many approaches, such as automatic …

Personalized adversarial data augmentation for dysarthric and elderly speech recognition

Z Jin, M Geng, J Deng, T Wang, S Hu… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech, accurate recognition of dysarthric and elderly speech remains a highly …

Improving the efficiency of dysarthria voice conversion system based on data augmentation

WZ Zheng, JY Han, CY Chen… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Dysarthria, a speech disorder often caused by neurological damage, compromises the
control of vocal muscles in patients, making their speech unclear and communication …

The effect of speech pathology on automatic speaker verification: a large-scale study

S Tayebi Arasteh, T Weise, M Schuster, E Noeth… - Scientific Reports, 2023 - nature.com
Navigating the challenges of data-driven speech processing, one of the primary hurdles is
accessing reliable pathological speech data. While public datasets appear to offer solutions …

Deep Learning for Pathological Speech: A Survey

SA Sheikh, M Sahidullah, I Kodrasi - arXiv preprint arXiv:2501.03536, 2025 - arxiv.org
Advancements in spoken language technologies for neurodegenerative speech disorders
are crucial for meeting both clinical and technological needs. This overview paper is vital for …

High-intelligibility speech synthesis for dysarthric speakers with LPCNet-based TTS and CycleVAE-based VC

K Matsubara, T Okamoto, R Takashima… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
This paper presents a high-intelligibility speech synthesis method for persons with dysarthria
caused by athetoid cerebral palsy. The muscular control of such speakers is unstable …