Automatic speech recognition and speech variability: A review

M Benzeghiba, R De Mori, O Deroo, S Dupont… - Speech …, 2007 - Elsevier
Major progress is being recorded regularly on both the technology and exploitation of
automatic speech recognition (ASR) and spoken language systems. However, there are still …

Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains

JL Gauvain, CH Lee - IEEE transactions on speech and audio …, 1994 - ieeexplore.ieee.org
In this paper, a framework for maximum a posteriori (MAP) estimation of hidden Markov
models (HMM) is presented. Three key issues of MAP estimation, namely, the choice of prior …

[图书][B] Distant speech recognition

M Wölfel, J McDonough - 2009 - books.google.com
A complete overview of distant automatic speech recognition The performance of
conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon …

Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction

V Vestman, D Gowda, M Sahidullah, P Alku… - Speech …, 2018 - Elsevier
From the available biometric technologies, automatic speaker recognition is one of the most
convenient and accessible ones due to abundance of mobile devices equipped with a …

Speaker adaptation using constrained estimation of Gaussian mixtures

VV Digalakis, D Rtischev… - IEEE Transactions on …, 1995 - ieeexplore.ieee.org
A trend in automatic speech recognition systems is the use of continuous mixture-density
hidden Markov models (HMMs). Despite the good recognition performance that these …

A maximum-likelihood approach to stochastic matching for robust speech recognition

A Sankar, CH Lee - IEEE transactions on speech and Audio …, 1996 - ieeexplore.ieee.org
Presents a maximum-likelihood (ML) stochastic matching approach to decrease the acoustic
mismatch between a test utterance and a given set of speech models so as to reduce the …

Choice of basis for Laplace approximation

DJC MacKay - Machine learning, 1998 - Springer
Maximum a posteriori optimization of parameters and the Laplace approximation for the
marginal likelihood are both basis-dependent methods. This note compares two choices of …

[HTML][HTML] Electrooculography-based continuous eye-writing recognition system for efficient assistive communication systems

F Fang, T Shinozaki - PloS one, 2018 - journals.plos.org
Human-computer interface systems whose input is based on eye movements can serve as a
means of communication for patients with locked-in syndrome. Eye-writing is one such …

On adaptive decision rules and decision parameter adaptation for automatic speech recognition

CH Lee, Q Huo - Proceedings of the IEEE, 2000 - ieeexplore.ieee.org
Recent advances in automatic speech recognition are accomplished by designing a plug-in
maximum a posteriori decision rule such that the forms of the acoustic and language model …

Deep-neural network approaches for speech recognition with heterogeneous groups of speakers including children

R Serizel, D Giuliani - Natural Language Engineering, 2017 - cambridge.org
This paper introduces deep neural network (DNN)–hidden Markov model (HMM)-based
methods to tackle speech recognition in heterogeneous groups of speakers including …