A survey of automatic speech recognition for dysarthric speech

Z Qian, K Xiao - Electronics, 2023 - mdpi.com
Dysarthric speech has several pathological characteristics, such as discontinuous
pronunciation, uncontrolled volume, slow speech, explosive pronunciation, improper …

Speaker adaptation using spectro-temporal deep features for dysarthric and elderly speech recognition

M Geng, X Xie, Z Ye, T Wang, G Li, S Hu… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech in recent decades, accurate recognition of dysarthric and elderly speech …

Disordered speech recognition considering low resources and abnormal articulation

Y Lin, J Dang, L Wang, S Li, C Ding - Speech Communication, 2023 - Elsevier
The success of automatic speech recognition (ASR) benefits a great number of healthy
people, but not people with disorders. The speech disordered may truly need support from …

Personalized adversarial data augmentation for dysarthric and elderly speech recognition

Z Jin, M Geng, J Deng, T Wang, S Hu… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech, accurate recognition of dysarthric and elderly speech remains a highly …

[HTML][HTML] Recent advancements in automatic disordered speech recognition: A survey paper

N Gohider, OA Basir - Natural Language Processing Journal, 2024 - Elsevier
Abstract Automatic Speech Recognition technology (ASR) has recently witnessed a
paradigm shift with respect to performance accuracy. Nevertheless, impaired speech …

Variational auto-encoder based variability encoding for dysarthric speech recognition

X Xie, R Ruzi, X Liu, L Wang - arXiv preprint arXiv:2201.09422, 2022 - arxiv.org
Dysarthric speech recognition is a challenging task due to acoustic variability and limited
amount of available data. Diverse conditions of dysarthric speakers account for the acoustic …

[PDF][PDF] Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition.

J Deng, FR Gutierrez, S Hu, M Geng, X Xie, Z Ye… - Interspeech, 2021 - se.cuhk.edu.hk
Automatic recognition of elderly and disordered speech remains a highly challenging task to
date. Such data is not only difficult to collect in large quantities, but also exhibits a significant …

RS-MSConvNet: A novel end-to-end pathological voice detection model

W Pathonsuwan, K Phapatanaburi, P Buayai… - IEEE …, 2022 - ieeexplore.ieee.org
Recent studies have reported the success of multi-scale convolution neural network
(MSConvNet) model for many classification applications due to its powerful ability of …

Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition

S Hu, X Xie, M Geng, Z Jin, J Deng, G Li… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Self-supervised learning (SSL) based speech foundation models have been applied to a
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …

CFDRN: A Cognition-Inspired Feature Decomposition and Recombination Network for Dysarthric Speech Recognition

Y Lin, L Wang, Y Yang, J Dang - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
As an essential technology in human–computer interactions, automatic speech recognition
(ASR) ensures a convenient life for healthy people; however, people with speech disorders …