Automatic speech recognition using limited vocabulary: A survey

JLKE Fendji, DCM Tala, BO Yenke… - Applied Artificial …, 2022 - Taylor & Francis
ABSTRACT Automatic Speech Recognition (ASR) is an active field of research due to its
large number of applications and the proliferation of interfaces or computing devices that …

Unsupervised learning of morphology

H Hammarström, L Borin - Computational Linguistics, 2011 - direct.mit.edu
This article surveys work on Unsupervised Learning of Morphology. We define
Unsupervised Learning of Morphology as the problem of inducing a description (of some …

Automatic speech recognition for under-resourced languages: A survey

L Besacier, E Barnard, A Karpov, T Schultz - Speech communication, 2014 - Elsevier
Speech processing for under-resourced languages is an active field of research, which has
experienced significant progress during the past decade. We propose, in this paper, a …

Morph-based speech recognition and modeling of out-of-vocabulary words across languages

M Creutz, T Hirsimäki, M Kurimo, A Puurula… - ACM Transactions on …, 2007 - dl.acm.org
We explore the use of morph-based language models in large-vocabulary continuous-
speech recognition systems across four so-called morphologically rich languages: Finnish …

[PDF][PDF] Word segmentation of informal Arabic with domain adaptation

W Monroe, S Green, CD Manning - … of the 52nd Annual Meeting of …, 2014 - aclanthology.org
Segmentation of clitics has been shown to improve accuracy on a variety of Arabic NLP
tasks. However, state-of-the-art Arabic word segmenters are either limited to formal Modern …

Turkish broadcast news transcription and retrieval

E Arisoy, D Can, S Parlak, H Sak… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
This paper summarizes our recent efforts for building a Turkish Broadcast News transcription
and retrieval system. The agglutinative nature of Turkish leads to a high number of out-of …

Importance of high-order n-gram models in morph-based speech recognition

T Hirsimaki, J Pylkkonen… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Speech recognition systems trained for morphologically rich languages face the problem of
vocabulary growth caused by prefixes, suffixes, inflections, and compound words. Solutions …

Improved subword modeling for WFST-based speech recognition

P Smit, S Virpioja, M Kurimo - Interspeech, 2017 - research.aalto.fi
Because in agglutinative languages the number of observed word forms is very high,
subword units are often utilized in speech recognition. However, the proper use of subword …

Alternative structures for character-level RNNs

P Bojanowski, A Joulin, T Mikolov - arXiv preprint arXiv:1511.06303, 2015 - arxiv.org
Recurrent neural networks are convenient and efficient models for language modeling.
However, when applied on the level of characters instead of words, they suffer from several …

Large vocabulary Russian speech recognition using syntactico-statistical language modeling

A Karpov, K Markov, I Kipyatkova, D Vazhenina… - Speech …, 2014 - Elsevier
Speech is the most natural way of human communication and in order to achieve convenient
and efficient human–computer interaction implementation of state-of-the-art spoken …