Maximum likelihood linear transformations for HMM-based speech recognition
MJF Gales - Computer speech & language, 1998 - Elsevier
This paper examines the application of linear transformations for speaker and environmental
adaptation in an HMM-based speech recognition system. In particular, transformations that …
adaptation in an HMM-based speech recognition system. In particular, transformations that …
[图书][B] Digital speech processing: synthesis, and recognition
S Furui - 2018 - taylorfrancis.com
A study of digital speech processing, synthesis and recognition. This second edition
contains new sections on the international standardization of robust and flexible speech …
contains new sections on the international standardization of robust and flexible speech …
[图书][B] Distant speech recognition
M Wölfel, J McDonough - 2009 - books.google.com
A complete overview of distant automatic speech recognition The performance of
conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon …
conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon …
Cluster adaptive training of hidden Markov models
MJF Gales - IEEE transactions on speech and audio …, 2000 - ieeexplore.ieee.org
When performing speaker adaptation, there are two conflicting requirements. First, the
speaker transform must be powerful enough to represent the speaker. Second, the transform …
speaker transform must be powerful enough to represent the speaker. Second, the transform …
Apparatus and method for speech utterance verification
Speech recognition is a problem of pattern matching. Recorded speech patterns are treated
as sequences of electrical signals. A recognition process involves classifying segments of …
as sequences of electrical signals. A recognition process involves classifying segments of …
Channel robust speaker verification via feature mapping
DA Reynolds - … Conference on Acoustics, Speech, and Signal …, 2003 - ieeexplore.ieee.org
In speaker recognition applications, channel variability is a major cause of errors.
Techniques in the feature, model and score domains have been applied to mitigate channel …
Techniques in the feature, model and score domains have been applied to mitigate channel …
Method and apparatus for fast on-line automatic speaker/environment adaptation for speech/speaker recognition in the presence of changing environments
S Kadambe, R Burns, M Iseli - US Patent 7,457,745, 2008 - Google Patents
(57) ABSTRACT A fast on-line automatic speaker/environment adaptation suitable for
speech/speaker recognition system, method and computer program product are presented …
speech/speaker recognition system, method and computer program product are presented …
Vocal tract normalization equals linear transformation in cepstral space
M Pitz, H Ney - IEEE Transactions on Speech and Audio …, 2005 - ieeexplore.ieee.org
Vocal tract normalization (VTN) is a widely used speaker normalization technique which
reduces the effect of different lengths of the human vocal tract and results in an improved …
reduces the effect of different lengths of the human vocal tract and results in an improved …
[图书][B] Biometric authentication: a machine learning approach
Gaussian Mixture Models (GMMs) and Radial Basis Function (RBF) networks are two of the
promising neural models for pattern classification. In this laboratory exercise, your task is to …
promising neural models for pattern classification. In this laboratory exercise, your task is to …
MVA processing of speech features
CP Chen, JA Bilmes - IEEE Transactions on Audio, Speech …, 2006 - ieeexplore.ieee.org
In this paper, we investigate a technique consisting of mean subtraction, variance
normalization and time sequence filtering. Unlike other techniques, it applies auto …
normalization and time sequence filtering. Unlike other techniques, it applies auto …