Automatic speech recognition for under-resourced languages: A survey

L Besacier, E Barnard, A Karpov, T Schultz - Speech communication, 2014 - Elsevier
Speech processing for under-resourced languages is an active field of research, which has
experienced significant progress during the past decade. We propose, in this paper, a …

Sub-lexical language models with word level pronunciation lexicons

H Sak, M Saraclar - US Patent 9,292,489, 2016 - Google Patents
An automatic speech recognition (ASR) system and method are provided for using sub-
lexical language models together with word level pronunciation lexicons. These approaches …

Importance of high-order n-gram models in morph-based speech recognition

T Hirsimaki, J Pylkkonen… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Speech recognition systems trained for morphologically rich languages face the problem of
vocabulary growth caused by prefixes, suffixes, inflections, and compound words. Solutions …

Automatic speech recognition for under-resourced languages: application to Vietnamese language

VB Le, L Besacier - IEEE Transactions on Audio, Speech, and …, 2009 - ieeexplore.ieee.org
This paper presents our work in automatic speech recognition (ASR) in the context of under-
resourced languages with application to Vietnamese. Different techniques for bootstrapping …

Development of a large spontaneous speech database of agglutinative Hungarian language

T Neuberger, D Gyarmathy, TE Gráczi… - Text, Speech and …, 2014 - Springer
In this paper, a large Hungarian spoken language database is introduced. This phonetically-
based multi-purpose database contains various types of spontaneous and read speech from …

Automatic transcription challenges for Inuktitut, a low-resource polysynthetic language

V Gupta, G Boulianne - … of the Twelfth Language Resources and …, 2020 - aclanthology.org
We introduce the first attempt at automatic speech recognition (ASR) in Inuktitut, as a
representative for polysynthetic, low-resource languages, like many of the 900 Indigenous …

Improved recognition of spontaneous Hungarian speech—Morphological and acoustic modeling techniques for a less resourced task

P Mihajlik, Z Tuske, B Tarján… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Various morphological and acoustic modeling techniques are evaluated on a less
resourced, spontaneous Hungarian large-vocabulary continuous speech recognition …

Morpholexical and discriminative language models for Turkish automatic speech recognition

H Sak, M Saraçlar, T Gungor - IEEE transactions on audio …, 2012 - ieeexplore.ieee.org
This paper introduces two complementary language modeling approaches for
morphologically rich languages aiming to alleviate out-of-vocabulary (OOV) word problem …

[PDF][PDF] Computational modeling of agglutinative languages: the challenge for southern bantu languages

F Kambarami, S McLachlan, B Bozic… - Arusha Work. Pap. Afr …, 2021 - academia.edu
In computational linguistics, language models are probabilistic models that predict the
likelihood of words occurring within specific sentences. They are key components of many …

SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian

H Cucu, A Buzo, L Besacier, C Burileanu - Speech Communication, 2014 - Elsevier
This study investigates the possibility of using statistical machine translation to create
domain-specific language resources. We propose a methodology that aims to create a …