Automatic speech recognition using limited vocabulary: A survey
JLKE Fendji, DCM Tala, BO Yenke… - Applied Artificial …, 2022 - Taylor & Francis
ABSTRACT Automatic Speech Recognition (ASR) is an active field of research due to its
large number of applications and the proliferation of interfaces or computing devices that …
large number of applications and the proliferation of interfaces or computing devices that …
Unsupervised learning of morphology
H Hammarström, L Borin - Computational Linguistics, 2011 - direct.mit.edu
This article surveys work on Unsupervised Learning of Morphology. We define
Unsupervised Learning of Morphology as the problem of inducing a description (of some …
Unsupervised Learning of Morphology as the problem of inducing a description (of some …
Automatic speech recognition for under-resourced languages: A survey
Speech processing for under-resourced languages is an active field of research, which has
experienced significant progress during the past decade. We propose, in this paper, a …
experienced significant progress during the past decade. We propose, in this paper, a …
Morph-based speech recognition and modeling of out-of-vocabulary words across languages
We explore the use of morph-based language models in large-vocabulary continuous-
speech recognition systems across four so-called morphologically rich languages: Finnish …
speech recognition systems across four so-called morphologically rich languages: Finnish …
[PDF][PDF] Word segmentation of informal Arabic with domain adaptation
W Monroe, S Green, CD Manning - … of the 52nd Annual Meeting of …, 2014 - aclanthology.org
Segmentation of clitics has been shown to improve accuracy on a variety of Arabic NLP
tasks. However, state-of-the-art Arabic word segmenters are either limited to formal Modern …
tasks. However, state-of-the-art Arabic word segmenters are either limited to formal Modern …
Turkish broadcast news transcription and retrieval
This paper summarizes our recent efforts for building a Turkish Broadcast News transcription
and retrieval system. The agglutinative nature of Turkish leads to a high number of out-of …
and retrieval system. The agglutinative nature of Turkish leads to a high number of out-of …
Importance of high-order n-gram models in morph-based speech recognition
T Hirsimaki, J Pylkkonen… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Speech recognition systems trained for morphologically rich languages face the problem of
vocabulary growth caused by prefixes, suffixes, inflections, and compound words. Solutions …
vocabulary growth caused by prefixes, suffixes, inflections, and compound words. Solutions …
Improved subword modeling for WFST-based speech recognition
Because in agglutinative languages the number of observed word forms is very high,
subword units are often utilized in speech recognition. However, the proper use of subword …
subword units are often utilized in speech recognition. However, the proper use of subword …
Alternative structures for character-level RNNs
Recurrent neural networks are convenient and efficient models for language modeling.
However, when applied on the level of characters instead of words, they suffer from several …
However, when applied on the level of characters instead of words, they suffer from several …
Large vocabulary Russian speech recognition using syntactico-statistical language modeling
Speech is the most natural way of human communication and in order to achieve convenient
and efficient human–computer interaction implementation of state-of-the-art spoken …
and efficient human–computer interaction implementation of state-of-the-art spoken …