Maximum likelihood linear transformations for HMM-based speech recognition

MJF Gales - Computer speech & language, 1998 - Elsevier
This paper examines the application of linear transformations for speaker and environmental
adaptation in an HMM-based speech recognition system. In particular, transformations that …

[图书][B] Digital speech processing: synthesis, and recognition

S Furui - 2018 - taylorfrancis.com
A study of digital speech processing, synthesis and recognition. This second edition
contains new sections on the international standardization of robust and flexible speech …

[图书][B] Distant speech recognition

M Wölfel, J McDonough - 2009 - books.google.com
A complete overview of distant automatic speech recognition The performance of
conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon …

Cluster adaptive training of hidden Markov models

MJF Gales - IEEE transactions on speech and audio …, 2000 - ieeexplore.ieee.org
When performing speaker adaptation, there are two conflicting requirements. First, the
speaker transform must be powerful enough to represent the speaker. Second, the transform …

Apparatus and method for speech utterance verification

B Ma, H Li, M Dong - 2008 - Google Patents
Speech recognition is a problem of pattern matching. Recorded speech patterns are treated
as sequences of electrical signals. A recognition process involves classifying segments of …

Channel robust speaker verification via feature mapping

DA Reynolds - … Conference on Acoustics, Speech, and Signal …, 2003 - ieeexplore.ieee.org
In speaker recognition applications, channel variability is a major cause of errors.
Techniques in the feature, model and score domains have been applied to mitigate channel …

Method and apparatus for fast on-line automatic speaker/environment adaptation for speech/speaker recognition in the presence of changing environments

S Kadambe, R Burns, M Iseli - US Patent 7,457,745, 2008 - Google Patents
(57) ABSTRACT A fast on-line automatic speaker/environment adaptation suitable for
speech/speaker recognition system, method and computer program product are presented …

Vocal tract normalization equals linear transformation in cepstral space

M Pitz, H Ney - IEEE Transactions on Speech and Audio …, 2005 - ieeexplore.ieee.org
Vocal tract normalization (VTN) is a widely used speaker normalization technique which
reduces the effect of different lengths of the human vocal tract and results in an improved …

[图书][B] Biometric authentication: a machine learning approach

SY Kung, MW Mak, SH Lin, MW Mak, S Lin - 2005 - eie.polyu.edu.hk
Gaussian Mixture Models (GMMs) and Radial Basis Function (RBF) networks are two of the
promising neural models for pattern classification. In this laboratory exercise, your task is to …

MVA processing of speech features

CP Chen, JA Bilmes - IEEE Transactions on Audio, Speech …, 2006 - ieeexplore.ieee.org
In this paper, we investigate a technique consisting of mean subtraction, variance
normalization and time sequence filtering. Unlike other techniques, it applies auto …