Development of language models for continuous Uzbek speech recognition system

A Mukhamadiyev, M Mukhiddinov, I Khujayarov… - Sensors, 2023 - mdpi.com
Automatic speech recognition systems with a large vocabulary and other natural language
processing applications cannot operate without a language model. Most studies on pre …

Creating a morphological and syntactic tagged corpus for the Uzbek language

M Sharipov, J Mattiev, J Sobirov, R Baltayev - arXiv preprint arXiv …, 2022 - arxiv.org
Nowadays, creation of the tagged corpora is becoming one of the most important tasks of
Natural Language Processing (NLP). There are not enough tagged corpora to build …

A crowdsourced open-source Kazakh speech corpus and initial speech recognition baseline

Y Khassanov, S Mussakhojayeva… - arXiv preprint arXiv …, 2020 - arxiv.org
We present an open-source speech corpus for the Kazakh language. The Kazakh speech
corpus (KSC) contains around 332 hours of transcribed audio comprising over 153,000 …

[PDF][PDF] KSC2: An Industrial-Scale Open-Source Kazakh Speech Corpus.

S Mussakhojayeva, Y Khassanov, HA Varol - INTERSPEECH, 2022 - isca-archive.org
We present the first industrial-scale open-source Kazakh speech corpus for automatic
speech recognition research and development. Our corpus subsumes two previously …

KazakhTTS: An open-source Kazakh text-to-speech synthesis dataset

S Mussakhojayeva, A Janaliyeva… - arXiv preprint arXiv …, 2021 - arxiv.org
This paper introduces a high-quality open-source speech synthesis dataset for Kazakh, a
low-resource language spoken by over 13 million people worldwide. The dataset consists of …

Classification of scientific documents in the Kazakh language using deep neural networks and a fusion of images and text

A Bogdanchikov, D Ayazbayev, I Varlamis - Big Data and Cognitive …, 2022 - mdpi.com
The rapid development of natural language processing and deep learning techniques has
boosted the performance of related algorithms in several linguistic and text mining tasks …

Semantic hyper-graph based representation of nouns in the Kazakh language

B Yergesh, A Mukanova, A Sharipbay… - Computacion y …, 2014 - scielo.org.mx
We explain how semantic hyper-graphs are used to describe ontological models of
morphological rules of agglutinative languages, with the Kazakh language as a case study …

Developing an Online Kazakh-English-Russian Thesaurus of‎ Industry-Specific Terminology

AT Bayekeyeva, SZ Tazhibayeva, AA Shaheen… - International Journal of …, 2022 - ijscl.com
Industry-specific translation is one of the rapidly developing and highly demanded sectors in
Kazakhstan. This paper discusses the theoretical and methodological issues of compiling a …

Metalanguage and knowledgebase for Kazakh morphology

G Yelibayeva, A Mukanova, A Sharipbay… - … Science and Its …, 2019 - Springer
Currently, the volume of various information resources in the Turkic languages is increasing.
Processing of such resources requires thesauri and corpora created using a single …

[PDF][PDF] Syntactic annotation of kazakh: Following the universal dependencies guidelines. a report

A Makazhanov, A Sultangazina… - PROCEEDINGS OF …, 2015 - researchgate.net
The present work is a report on the authors' first attempt to use the universal dependencies
(UD)(de Marneffe et al., 2014) standard for syntactic annotation of Kazakh. The report is a …