Spoken language recognition: from fundamentals to practice

H Li, B Ma, KA Lee - Proceedings of the IEEE, 2013 - ieeexplore.ieee.org
Spoken language recognition refers to the automatic process through which we determine
or verify the identity of the language spoken in a speech sample. We study a computational …

Automatic speech recognition and speech variability: A review

M Benzeghiba, R De Mori, O Deroo, S Dupont… - Speech …, 2007 - Elsevier
Major progress is being recorded regularly on both the technology and exploitation of
automatic speech recognition (ASR) and spoken language systems. However, there are still …

Deep convolutional neural networks for large-scale speech tasks

TN Sainath, B Kingsbury, G Saon, H Soltau… - Neural networks, 2015 - Elsevier
Abstract Convolutional Neural Networks (CNNs) are an alternative type of neural network
that can be used to reduce spectral variations and model spectral correlations which exist in …

Maximum likelihood linear transformations for HMM-based speech recognition

MJF Gales - Computer speech & language, 1998 - Elsevier
This paper examines the application of linear transformations for speaker and environmental
adaptation in an HMM-based speech recognition system. In particular, transformations that …

The application of hidden Markov models in speech recognition

M Gales, S Young - Foundations and Trends® in Signal …, 2008 - nowpublishers.com
The Application of Hidden Markov Models in Speech Recognition Page 1 The Application of
Hidden Markov Models in Speech Recognition Full text available at: http://dx.doi.org/10.1561/2000000004 …

Adaptation algorithms for neural network-based speech recognition: An overview

P Bell, J Fainberg, O Klejch, J Li… - IEEE Open Journal …, 2020 - ieeexplore.ieee.org
We present a structured overview of adaptation algorithms for neural network-based speech
recognition, considering both hybrid hidden Markov model/neural network systems and end …

[图书][B] Distant speech recognition

M Wölfel, J McDonough - 2009 - books.google.com
A complete overview of distant automatic speech recognition The performance of
conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon …

[PDF][PDF] Discriminative training for large vocabulary speech recognition

D Povey - 2005 - researchgate.net
This thesis investigates the use of discriminative criteria for training HMM parameters for
speech recognition, in particular the Maximum Mutual Information (MMI) criterion and a new …

Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code

O Abdel-Hamid, H Jiang - 2013 IEEE International Conference …, 2013 - ieeexplore.ieee.org
In this paper, we propose a new fast speaker adaptation method for the hybrid NN-HMM
speech recognition model. The adaptation method depends on a joint learning of a large …

Computing mel-frequency cepstral coefficients on the power spectrum

S Molau, M Pitz, R Schluter… - 2001 IEEE international …, 2001 - ieeexplore.ieee.org
We present a method to derive Mel-frequency cepstral coefficients directly from the power
spectrum of a speech signal. We show that omitting the filterbank in signal analysis does not …