Adaptation of deep neural network acoustic models using factorised i-vectors.

AB Nassif, I Shahin, I Attili, M Azzeh, K Shaalan - IEEE access, 2019 - ieeexplore.ieee.org

Over the past decades, a tremendous amount of research has been done on the use of
machine learning for speech processing applications, especially speech recognition …

被引用次数：1185 相关文章所有 9 个版本

[PDF] arxiv.org

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org

Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

被引用次数：391 相关文章所有 10 个版本

[PDF] hal.science

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

E Vincent, S Watanabe, AA Nugraha, J Barker… - Computer Speech & …, 2017 - Elsevier

Speech enhancement and automatic speech recognition (ASR) are most often evaluated in
matched (or multi-condition) settings where the acoustic conditions of the training data …

被引用次数：410 相关文章所有 16 个版本

[PDF] arxiv.org

Transfer learning for speech and language processing

D Wang, TF Zheng - 2015 Asia-Pacific Signal and Information …, 2015 - ieeexplore.ieee.org

Transfer learning is a vital technique that generalizes models trained for one setting or task
to other settings or tasks. For example in speech recognition, an acoustic model trained for …

被引用次数：245 相关文章所有 12 个版本

[PDF] ieee.org

Adaptation algorithms for neural network-based speech recognition: An overview

P Bell, J Fainberg, O Klejch, J Li… - IEEE Open Journal …, 2020 - ieeexplore.ieee.org

We present a structured overview of adaptation algorithms for neural network-based speech
recognition, considering both hybrid hidden Markov model/neural network systems and end …

被引用次数：87 相关文章所有 7 个版本

[PDF] ed.ac.uk

Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models

P Swietojanski, S Renals - 2014 IEEE Spoken Language …, 2014 - ieeexplore.ieee.org

This paper proposes a simple yet effective model-based neural network speaker adaptation
technique that learns speaker-specific hidden unit contributions given adaptation data …

被引用次数：275 相关文章所有 8 个版本

[PDF] ed.ac.uk

A study of speaker adaptation for DNN-based speech synthesis

Z Wu, P Swietojanski, C Veaux, S Renals… - Interspeech 2015, 2015 - research.ed.ac.uk

A major advantage of statistical parametric speech synthesis (SPSS) over unit-selection
speech synthesis is its adaptability and controllability in changing speaker characteristics …

被引用次数：173 相关文章所有 14 个版本

[PDF] arxiv.org

Learning hidden unit contributions for unsupervised acoustic model adaptation

P Swietojanski, J Li, S Renals - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org

This work presents a broad study on the adaptation of neural network acoustic models by
means of learning hidden unit contributions (LHUC)-a method that linearly re-combines …

被引用次数：152 相关文章所有 6 个版本

[PDF] cmu.edu

Speaker adaptive training of deep neural network acoustic models using i-vectors

Y Miao, H Zhang, F Metze - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org

In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique
for the traditional Gaussian mixture models (GMMs). Acoustic models trained with SAT …

被引用次数：144 相关文章所有 8 个版本

An end-to-end deep learning approach to simultaneous speech dereverberation and acoustic modeling for robust speech recognition

B Wu, K Li, F Ge, Z Huang, M Yang… - IEEE Journal of …, 2017 - ieeexplore.ieee.org

We propose an integrated end-to-end automatic speech recognition (ASR) paradigm by joint
learning of the front-end speech signal processing and back-end acoustic modeling. We …

被引用次数：85 相关文章所有 4 个版本