[PDF][PDF] Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition.
Automatic recognition of elderly and disordered speech remains a highly challenging task to
date. Such data is not only difficult to collect in large quantities, but also exhibits a significant …
date. Such data is not only difficult to collect in large quantities, but also exhibits a significant …
Bayesian learning of LF-MMI trained time delay neural networks for speech recognition
Discriminative training techniques define state-of-the-art performance for automatic speech
recognition systems. However, they are inherently prone to overfitting, leading to poor …
recognition systems. However, they are inherently prone to overfitting, leading to poor …
Domain adaptation of lattice-free MMI based TDNN models for speech recognition
Y Long, Y Li, H Ye, H Mao - International Journal of Speech Technology, 2017 - Springer
The recent proposed time-delay deep neural network (TDNN) acoustic models trained with
lattice-free maximum mutual information (LF-MMI) criterion have been shown to give …
lattice-free maximum mutual information (LF-MMI) criterion have been shown to give …
Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition
Self-supervised learning (SSL) based speech foundation models have been applied to a
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …
Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus
Early diagnosis of Neurocognitive Disorder (NCD) is crucial in facilitating preventive care
and timely treatment to delay further progression. This paper presents the development of a …
and timely treatment to delay further progression. This paper presents the development of a …
Exploring self-supervised pre-trained asr models for dysarthric and elderly speech recognition
Automatic recognition of disordered and elderly speech remains a highly challenging task to
date due to the difficulty in collecting such data in large quantities. This paper explores a …
date due to the difficulty in collecting such data in large quantities. This paper explores a …
Neural architecture search for LF-MMI trained time delay neural networks
State-of-the-art automatic speech recognition (ASR) system development is data and
computation intensive. The optimal design of deep neural networks (DNNs) for these …
computation intensive. The optimal design of deep neural networks (DNNs) for these …
Speaker adaptation using spectro-temporal deep features for dysarthric and elderly speech recognition
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech in recent decades, accurate recognition of dysarthric and elderly speech …
normal speech in recent decades, accurate recognition of dysarthric and elderly speech …
Bayesian transformer language models for speech recognition
State-of-the-art neural language models (LMs) represented by Transformers are highly
complex. Their use of fixed, deterministic parameter estimates fail to account for model …
complex. Their use of fixed, deterministic parameter estimates fail to account for model …
[PDF][PDF] Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features.
Speaker adaptation techniques play a key role in reducing the mismatch between automatic
speech recognition (ASR) systems and target users. Deep neural network (DNN) acoustic …
speech recognition (ASR) systems and target users. Deep neural network (DNN) acoustic …