Machine learning paradigms for speech recognition: An overview

L Deng, X Li - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) has historically been a driving force behind many
machine learning (ML) techniques, including the ubiquitously used hidden Markov model …

Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition

GE Dahl, D Yu, L Deng, A Acero - IEEE Transactions on audio …, 2011 - ieeexplore.ieee.org
We propose a novel context-dependent (CD) model for large-vocabulary speech recognition
(LVSR) that leverages recent advances in using deep belief networks for phone recognition …

The application of hidden Markov models in speech recognition

M Gales, S Young - Foundations and Trends® in Signal …, 2008 - nowpublishers.com
The Application of Hidden Markov Models in Speech Recognition Page 1 The Application of
Hidden Markov Models in Speech Recognition Full text available at: http://dx.doi.org/10.1561/2000000004 …

Discriminative training of HMMs for automatic speech recognition: A survey

H Jiang - Computer Speech & Language, 2010 - Elsevier
Recently, discriminative training (DT) methods have achieved tremendous progress in
automatic speech recognition (ASR). In this survey article, all mainstream DT methods in …

Confusion-matrix-based kernel logistic regression for imbalanced data classification

M Ohsaki, P Wang, K Matsuda… - … on Knowledge and …, 2017 - ieeexplore.ieee.org
There have been many attempts to classify imbalanced data, since this classification is
critical in a wide variety of applications related to the detection of anomalies, failures, and …

Automatic speech recognition: History, methods and challenges

D O'Shaughnessy - Pattern Recognition, 2008 - Elsevier
The field of automatic speech recognition (ASR) is discussed from the viewpoint of pattern
recognition (PR). This tutorial examines the problem area, its methods, successes and …

Large vocabulary continuous speech recognition with context-dependent DBN-HMMs

GE Dahl, D Yu, L Deng, A Acero - 2011 IEEE international …, 2011 - ieeexplore.ieee.org
The context-independent deep belief network (DBN) hidden Markov model (HMM) hybrid
architecture has recently achieved promising results for phone recognition. In this work, we …

[PDF][PDF] Voice conversion in high-order eigen space using deep belief nets.

T Nakashika, R Takashima, T Takiguchi… - Interspeech, 2013 - me.cs.scitec.kobe-u.ac.jp
This paper presents a voice conversion technique using Deep Belief Nets (DBNs) to build
high-order eigen spaces of the source/target speakers, where it is easier to convert the …

Discriminative learning in sequential pattern recognition

X He, L Deng, W Chou - IEEE Signal Processing Magazine, 2008 - ieeexplore.ieee.org
In this article, we studied the objective functions of MMI, MCE, and MPE/MWE for
discriminative learning in sequential pattern recognition. We presented an approach that …

Voice conversion using RNN pre-trained by recurrent temporal restricted Boltzmann machines

T Nakashika, T Takiguchi, Y Ariki - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
This paper presents a voice conversion (VC) method that utilizes the recently proposed
probabilistic models called recurrent temporal restricted Boltzmann machines (RTRBMs) …