Automatic recognition of spontaneous speech for access to multilingual oral history archives

W Byrne, D Doermann, M Franz… - … on Speech and …, 2004 - ieeexplore.ieee.org
Much is known about the design of automated systems to search broadcast news, but it has
only recently become possible to apply similar techniques to large collections of …

Improved recognition of spontaneous Hungarian speech—Morphological and acoustic modeling techniques for a less resourced task

P Mihajlik, Z Tuske, B Tarján… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Various morphological and acoustic modeling techniques are evaluated on a less
resourced, spontaneous Hungarian large-vocabulary continuous speech recognition …

System for fast lexical and phonetic spoken term detection in a czech cultural heritage archive

J Psutka, J Švec, JV Psutka, J Vaněk, A Pražák… - EURASIP Journal on …, 2011 - Springer
The main objective of the work presented in this paper was to develop a complete system
that would accomplish the original visions of the MALACH project. Those goals were to …

[PDF][PDF] Automatic transcription of Czech, Russian, and Slovak spontaneous speech in the MALACH project.

J Psutka, P Ircing, JV Psutka, J Hajic, WJ Byrne… - …, 2005 - ufal.mff.cuni.cz
Abstract This paper describes the 3.5-years effort put into building LVCSR systems for
recognition of spontaneous speech of Czech, Russian, and Slovak witnesses of the …

[PDF][PDF] A morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages-like Hungarian.

P Mihajlik, T Fegyó, Z Tüske, P Ircing - INTERSPEECH, 2007 - academia.edu
A coupled acoustic-and language-modeling approach is presented for the recognition of
spontaneous speech primarily in agglutinative languages. The effectiveness of the approach …

Application of lemmatization and summarization methods in topic identification module for large scale language modeling data filtering

L Skorkovská - Text, Speech and Dialogue: 15th International …, 2012 - Springer
The paper presents experiments with the topic identification module which is a part of a
complex system for acquisition and storing large volumes of text data. The topic identification …

General framework for mining, processing and storing large amounts of electronic texts for language modeling purposes

J Švec, J Lehečka, P Ircing, L Skorkovská… - Language resources …, 2014 - Springer
The paper describes a general framework for mining large amounts of text data from a
defined set of Web pages. The acquired data are meant to constitute a corpus for training …

Recognition of heavily accented and emotional speech of English and Czech Holocaust survivors using various DNN architectures

JV Psutka, A Pražák, J Vaněk - International Conference on Speech and …, 2021 - Springer
Abstract The Malach Project [6] verified the possibility of using automatic speech recognition
(ASR) methods to search for information in large multilingual archives of Holocaust …

[PDF][PDF] Corrective models for speech recognition of inflected languages

I Shafran, K Hall - Proceedings of the 2006 Conference on …, 2006 - aclanthology.org
This paper presents a corrective model for speech recognition of inflected languages. The
model, based on a discriminative framework, incorporates word ngrams features as well as …

Automatic topic identification for large scale language modeling data filtering

L Skorkovská, P Ircing, A Pražák, J Lehečka - Text, Speech and Dialogue …, 2011 - Springer
The paper presents a module for topic identification that is embedded into a complex system
for acquisition and storing large volumes of text data from the Web. The module processes …