Development of language models for continuous Uzbek speech recognition system
Automatic speech recognition systems with a large vocabulary and other natural language
processing applications cannot operate without a language model. Most studies on pre …
processing applications cannot operate without a language model. Most studies on pre …
Creating a morphological and syntactic tagged corpus for the Uzbek language
M Sharipov, J Mattiev, J Sobirov, R Baltayev - arXiv preprint arXiv …, 2022 - arxiv.org
Nowadays, creation of the tagged corpora is becoming one of the most important tasks of
Natural Language Processing (NLP). There are not enough tagged corpora to build …
Natural Language Processing (NLP). There are not enough tagged corpora to build …
A crowdsourced open-source Kazakh speech corpus and initial speech recognition baseline
Y Khassanov, S Mussakhojayeva… - arXiv preprint arXiv …, 2020 - arxiv.org
We present an open-source speech corpus for the Kazakh language. The Kazakh speech
corpus (KSC) contains around 332 hours of transcribed audio comprising over 153,000 …
corpus (KSC) contains around 332 hours of transcribed audio comprising over 153,000 …
[PDF][PDF] KSC2: An Industrial-Scale Open-Source Kazakh Speech Corpus.
We present the first industrial-scale open-source Kazakh speech corpus for automatic
speech recognition research and development. Our corpus subsumes two previously …
speech recognition research and development. Our corpus subsumes two previously …
KazakhTTS: An open-source Kazakh text-to-speech synthesis dataset
S Mussakhojayeva, A Janaliyeva… - arXiv preprint arXiv …, 2021 - arxiv.org
This paper introduces a high-quality open-source speech synthesis dataset for Kazakh, a
low-resource language spoken by over 13 million people worldwide. The dataset consists of …
low-resource language spoken by over 13 million people worldwide. The dataset consists of …
Classification of scientific documents in the Kazakh language using deep neural networks and a fusion of images and text
A Bogdanchikov, D Ayazbayev, I Varlamis - Big Data and Cognitive …, 2022 - mdpi.com
The rapid development of natural language processing and deep learning techniques has
boosted the performance of related algorithms in several linguistic and text mining tasks …
boosted the performance of related algorithms in several linguistic and text mining tasks …
Semantic hyper-graph based representation of nouns in the Kazakh language
We explain how semantic hyper-graphs are used to describe ontological models of
morphological rules of agglutinative languages, with the Kazakh language as a case study …
morphological rules of agglutinative languages, with the Kazakh language as a case study …
Developing an Online Kazakh-English-Russian Thesaurus of Industry-Specific Terminology
AT Bayekeyeva, SZ Tazhibayeva, AA Shaheen… - International Journal of …, 2022 - ijscl.com
Industry-specific translation is one of the rapidly developing and highly demanded sectors in
Kazakhstan. This paper discusses the theoretical and methodological issues of compiling a …
Kazakhstan. This paper discusses the theoretical and methodological issues of compiling a …
Metalanguage and knowledgebase for Kazakh morphology
Currently, the volume of various information resources in the Turkic languages is increasing.
Processing of such resources requires thesauri and corpora created using a single …
Processing of such resources requires thesauri and corpora created using a single …
[PDF][PDF] Syntactic annotation of kazakh: Following the universal dependencies guidelines. a report
A Makazhanov, A Sultangazina… - PROCEEDINGS OF …, 2015 - researchgate.net
The present work is a report on the authors' first attempt to use the universal dependencies
(UD)(de Marneffe et al., 2014) standard for syntactic annotation of Kazakh. The report is a …
(UD)(de Marneffe et al., 2014) standard for syntactic annotation of Kazakh. The report is a …