Single-channel multitalker speech recognition

SJ Rennie, JR Hershey… - IEEE Signal Processing …, 2010 - ieeexplore.ieee.org
We have described some of the problems with modeling mixed acoustic signals in the log
spectral domain using graphical models, as well as some current approaches to handling …

Hidden markov acoustic modeling with bootstrap and restructuring for low-resourced languages

X Cui, J Xue, X Chen, PA Olsen… - … on Audio, Speech …, 2012 - ieeexplore.ieee.org
This paper proposes an acoustic modeling approach based on bootstrap and restructuring
to dealing with data sparsity for low-resourced languages. The goal of the approach is to …

Hierarchical variational loopy belief propagation for multi-talker speech recognition

SJ Rennie, JR Hershey… - 2009 IEEE Workshop on …, 2009 - ieeexplore.ieee.org
We present a new method for multi-talker speech recognition using a single-channel that
combines loopy belief propagation and variational inference methods to control the …

[PDF][PDF] Refactoring acoustic models using variational expectation-maximization.

PL Dognin, JR Hershey, V Goel, PA Olsen - INTERSPEECH, 2009 - isca-archive.org
In probabilistic modeling, it is often useful to change the structure, or refactor, a model, so
that it has a different number of components, different parameter sharing, or other …

Model restructuring for client and server based automatic speech recognition

P Dognin, V Goel, JR Hershey, PA Olsen - US Patent 8,635,067, 2014 - Google Patents
Access is obtained to a large reference acoustic model for automatic speech recognition.
The large reference acoustic model has L states modeled by L mixture models, and the …

[PDF][PDF] Restructuring exponential family mixture models

PL Dognin, JR Hershey, V Goel… - … Annual Conference of the …, 2010 - researchgate.net
Variational KL (varKL) divergence minimization was previously applied to restructuring
acoustic models (AMs) using Gaussian mixture models by reducing their size while …

A New Distance Measure for a Variable‐Sized Acoustic Model Based on MDL Technique

HY Cho, S Kim - ETRI journal, 2010 - Wiley Online Library
Embedding a large vocabulary speech recognition system in mobile devices requires a
reduced acoustic model obtained by eliminating redundant model parameters. In …

[PDF][PDF] Rapid Nonlinear Speaker Adaptation for Large-Vocabulary Continuous Speech Recognition.

Z Roupakia, A Ragni, MJF Gales - INTERSPEECH, 2012 - isca-archive.org
Recently, kernel eigenvoices were revisited using kernel representations of distributions for
rapid nonlinear speaker adaptation. These representations reassure the validity of the …

[PDF][PDF] Restructuring acoustic models for client and server-based automatic speech recognition,”

PL Dognin, JR Hershey, V Goel, PA Olsen - SQ2010, Mar, 2010 - researchgate.net
ABSTRACT A problem often encountered in probabilistic modeling is restructuring a model
to change its number of components, parameter sharing, or some other structural …

A Study on Improved MDL Technique for Optimization of Acoustic Model

HY Cho, SH Kim - The Journal of the Acoustical Society of Korea, 2010 - koreascience.kr
This paper describes optimization methods of acoustic models in HMM-based continuous
speech recognition. Most of the conventional speech recognition systems use the same …