[PDF][PDF] The RWTH Aachen University open source speech recognition system
D Rybach, C Gollan, G Heigold… - … Annual Conference of …, 2009 - isca-archive.org
We announce the public availability of the RWTH Aachen University speech recognition
toolkit. The toolkit includes state of the art speech recognition technology for acoustic model …
toolkit. The toolkit includes state of the art speech recognition technology for acoustic model …
[PDF][PDF] Rasr-the rwth aachen university open source speech recognition toolkit
D Rybach, S Hahn, P Lehnen… - Proc. ieee …, 2011 - www-i6.informatik.rwth-aachen.de
RASR is the open source version of the well-proven speech recognition toolkit developed
and used at RWTH Aachen University. The current version of the package includes state of …
and used at RWTH Aachen University. The current version of the package includes state of …
Multiple proposals for continuous arabic sign language recognition
The deaf community relies on sign language as the primary means of communication. For
the millions of people around the world who suffer from hearing loss, interaction with hearing …
the millions of people around the world who suffer from hearing loss, interaction with hearing …
[PDF][PDF] Cross-language bootstrapping for unsupervised acoustic model training: rapid development of a Polish speech recognition system.
This paper describes the rapid development of a Polish language speech recognition
system. The system development was performed without access to any transcribed acoustic …
system. The system development was performed without access to any transcribed acoustic …
Equivalence of generative and log-linear models
Conventional speech recognition systems are based on hidden Markov models (HMMs) with
Gaussian mixture models (GHMMs). Discriminative log-linear models are an alternative …
Gaussian mixture models (GHMMs). Discriminative log-linear models are an alternative …
[PDF][PDF] Automatic Live Subtitling: state of the art, expectations and current trends
C Aliprandi, C Scudellari, I Gallucci… - Proceedings of NAB …, 2014 - vicomtech.org
The subtitling demand has grown quickly over the years. The path of manual subtitling is no
longer feasible, due to increased costs and reduced production times. Assisted Subtitling is …
longer feasible, due to increased costs and reduced production times. Assisted Subtitling is …
Automating live and batch subtitling of multimedia contents for several European languages
The subtitling demand of multimedia content has grown quickly over the last years,
especially after the adoption of the new European audiovisual legislation, which forces to …
especially after the adoption of the new European audiovisual legislation, which forces to …
White-space models for offline Arabic handwriting recognition
We propose to explicitly model white-spaces for Arabic handwriting recognition within
different writing variants. Position-dependent character shapes in Arabic handwriting allow …
different writing variants. Position-dependent character shapes in Arabic handwriting allow …
Modified MMI/MPE: A direct evaluation of the margin in speech recognition
In this paper we show how common speech recognition training criteria such as the
Minimum Phone Error criterion or the Maximum Mutual Information criterion can be …
Minimum Phone Error criterion or the Maximum Mutual Information criterion can be …
[PDF][PDF] A log-linear discriminative modeling framework for speech recognition
G Heigold - 2010 - www-i6.informatik.rwth-aachen.de
Conventional speech recognition systems are based on Gaussian hidden Markov models
(HMMs). Discriminative techniques such as log-linear modeling have been investigated in …
(HMMs). Discriminative techniques such as log-linear modeling have been investigated in …