[图书][B] Distant speech recognition

M Wölfel, J McDonough - 2009 - books.google.com
A complete overview of distant automatic speech recognition The performance of
conventional Automatic Speech Recognition (ASR) systems degrades dramatically as soon …

[PDF][PDF] Dynamic language model adaptation using variational Bayes inference.

YC Tam, T Schultz - INTERSPEECH, 2005 - Citeseer
We propose an unsupervised dynamic language model (LM) adaptation framework using
long-distance latent topic mixtures. The framework employs the Latent Dirichlet Allocation …

[PDF][PDF] Unsupervised language model adaptation using latent semantic marginals

YC Tam, T Schultz - Ninth International Conference on …, 2006 - isl.anthropomatik.kit.edu
Abstract We integrated the Latent Dirichlet Allocation (LDA) approach, a latent semantic
analysis model, into unsupervised language model adaptation framework. We adapted a …

Improved models for Mandarin speech-to-text transcription

L Lamel, JL Gauvain, VB Le, I Oparin… - 2011 IEEE International …, 2011 - ieeexplore.ieee.org
This paper describes recent advances at LIMSI in Mandarin Chinese speech-to-text
transcription. A number of novel approaches were introduced in the different system …

Speech translation enhanced automatic speech recognition

M Paulik, S Stuker, C Fugen, T Schultz… - IEEE Workshop on …, 2005 - ieeexplore.ieee.org
Nowadays official documents have to be made available in many languages, like for
example in the EU with its 20 official languages. Therefore, the need for effective tools to aid …

The CU-HTK Mandarin broadcast news transcription system

R Sinha, MJF Gales, DY Kim, XA Liu… - … on Acoustics Speech …, 2006 - ieeexplore.ieee.org
This paper discusses the development of the CU-HTK Mandarin broadcast news (BN)
transcription system. The Mandarin BN task includes a significant amount of English data …

[PDF][PDF] Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end.

S Stüker, C Fügen, S Burger, M Wölfel - INTERSPEECH, 2006 - academia.edu
Cross-system adaptation and system combination methods, such as ROVER and confusion
network combination, are known to lower the word error rate of speech recognition systems …

Correlated latent semantic model for unsupervised LM adaptation

YC Tam, T Schultz - … Speech and Signal Processing-ICASSP'07, 2007 - ieeexplore.ieee.org
We propose a latent Dirichlet-tree allocation (LDTA) model-a correlated latent semantic
model-for unsupervised language model adaptation. The LDTA model extends the latent …

[PDF][PDF] Advances in lecture recognition: the ISL RT-06s evaluation system.

C Fügen, M Wölfel, JW McDonough, S Ikbal, F Kraft… - …, 2006 - cs.cmu.edu
This paper describes the 2006 lecture recognition system developed at the Interactive
Systems Laboratories (ISL), for individual head-microphone (IHM), single distant …

A broadcast news corpus for evaluation and tuning of German LVCSR systems

F Weninger, B Schuller, F Eyben, M Wöllmer… - arXiv preprint arXiv …, 2014 - arxiv.org
Transcription of broadcast news is an interesting and challenging application for large-
vocabulary continuous speech recognition (LVCSR). We present in detail the structure of a …