Automatic speech recognition for under-resourced languages: A survey
Speech processing for under-resourced languages is an active field of research, which has
experienced significant progress during the past decade. We propose, in this paper, a …
experienced significant progress during the past decade. We propose, in this paper, a …
Sub-lexical language models with word level pronunciation lexicons
H Sak, M Saraclar - US Patent 9,292,489, 2016 - Google Patents
An automatic speech recognition (ASR) system and method are provided for using sub-
lexical language models together with word level pronunciation lexicons. These approaches …
lexical language models together with word level pronunciation lexicons. These approaches …
Importance of high-order n-gram models in morph-based speech recognition
T Hirsimaki, J Pylkkonen… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Speech recognition systems trained for morphologically rich languages face the problem of
vocabulary growth caused by prefixes, suffixes, inflections, and compound words. Solutions …
vocabulary growth caused by prefixes, suffixes, inflections, and compound words. Solutions …
Automatic speech recognition for under-resourced languages: application to Vietnamese language
VB Le, L Besacier - IEEE Transactions on Audio, Speech, and …, 2009 - ieeexplore.ieee.org
This paper presents our work in automatic speech recognition (ASR) in the context of under-
resourced languages with application to Vietnamese. Different techniques for bootstrapping …
resourced languages with application to Vietnamese. Different techniques for bootstrapping …
Development of a large spontaneous speech database of agglutinative Hungarian language
T Neuberger, D Gyarmathy, TE Gráczi… - Text, Speech and …, 2014 - Springer
In this paper, a large Hungarian spoken language database is introduced. This phonetically-
based multi-purpose database contains various types of spontaneous and read speech from …
based multi-purpose database contains various types of spontaneous and read speech from …
Automatic transcription challenges for Inuktitut, a low-resource polysynthetic language
V Gupta, G Boulianne - … of the Twelfth Language Resources and …, 2020 - aclanthology.org
We introduce the first attempt at automatic speech recognition (ASR) in Inuktitut, as a
representative for polysynthetic, low-resource languages, like many of the 900 Indigenous …
representative for polysynthetic, low-resource languages, like many of the 900 Indigenous …
Improved recognition of spontaneous Hungarian speech—Morphological and acoustic modeling techniques for a less resourced task
Various morphological and acoustic modeling techniques are evaluated on a less
resourced, spontaneous Hungarian large-vocabulary continuous speech recognition …
resourced, spontaneous Hungarian large-vocabulary continuous speech recognition …
Morpholexical and discriminative language models for Turkish automatic speech recognition
This paper introduces two complementary language modeling approaches for
morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem …
morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem …
[PDF][PDF] Computational modeling of agglutinative languages: the challenge for southern bantu languages
In computational linguistics, language models are probabilistic models that predict the
likelihood of words occurring within specific sentences. They are key components of many …
likelihood of words occurring within specific sentences. They are key components of many …
SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian
This study investigates the possibility of using statistical machine translation to create
domain-specific language resources. We propose a methodology that aims to create a …
domain-specific language resources. We propose a methodology that aims to create a …