Speech recognition using deep neural networks: A systematic review

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org
Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

E Vincent, S Watanabe, AA Nugraha, J Barker… - Computer Speech & …, 2017 - Elsevier
Speech enhancement and automatic speech recognition (ASR) are most often evaluated in
matched (or multi-condition) settings where the acoustic conditions of the training data …

Transfer learning for speech and language processing

D Wang, TF Zheng - 2015 Asia-Pacific Signal and Information …, 2015 - ieeexplore.ieee.org
Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …

Adaptation algorithms for neural network-based speech recognition: An overview

P Bell, J Fainberg, O Klejch, J Li… - IEEE Open Journal …, 2020 - ieeexplore.ieee.org
We present a structured overview of adaptation algorithms for neural network-based speech
recognition, considering both hybrid hidden Markov model/neural network systems and end …

Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models

P Swietojanski, S Renals - 2014 IEEE Spoken Language …, 2014 - ieeexplore.ieee.org
This paper proposes a simple yet effective model-based neural network speaker adaptation
technique that learns speaker-specific hidden unit contributions given adaptation data …

A study of speaker adaptation for DNN-based speech synthesis

Z Wu, P Swietojanski, C Veaux, S Renals… - Interspeech 2015, 2015 - research.ed.ac.uk
A major advantage of statistical parametric speech synthesis (SPSS) over unit-selection
speech synthesis is its adaptability and controllability in changing speaker characteristics …

Learning hidden unit contributions for unsupervised acoustic model adaptation

P Swietojanski, J Li, S Renals - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
This work presents a broad study on the adaptation of neural network acoustic models by
means of learning hidden unit contributions (LHUC)-a method that linearly re-combines …

Speaker adaptive training of deep neural network acoustic models using i-vectors

Y Miao, H Zhang, F Metze - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique
for the traditional Gaussian mixture models (GMMs). Acoustic models trained with SAT …

An end-to-end deep learning approach to simultaneous speech dereverberation and acoustic modeling for robust speech recognition

B Wu, K Li, F Ge, Z Huang, M Yang… - IEEE Journal of …, 2017 - ieeexplore.ieee.org
We propose an integrated end-to-end automatic speech recognition (ASR) paradigm by joint
learning of the front-end speech signal processing and back-end acoustic modeling. We …