[HTML][HTML] Developing named entity recognition algorithms for Uzbek: Dataset Insights and Implementation

D Mengliev, V Barakhnin, N Abdurakhmonova… - Data in Brief, 2024 - Elsevier
This paper presents a dataset and approaches to named entity recognition (NLP) in Uzbek
language, in a resource-constrained language environment. Despite the increase in NLP …

Uzbek text's correspondence with the educational potential of pupils: a case study of the School corpus

K Madatov, S Matlatipov, M Aripov - arXiv preprint arXiv:2303.00465, 2023 - arxiv.org
One of the major challenges of an educational system is choosing appropriate content
considering pupils' age and intellectual potential. In this article the experiment of primary …

Uzbek text summarization based on TF-IDF

K Madatov, S Bekchanov, J Vičič - arXiv preprint arXiv:2303.00461, 2023 - arxiv.org
The volume of information is increasing at an incredible rate with the rapid development of
the Internet and electronic information services. Due to time constraints, we don't have the …

Automatic detection of stop words for texts in the Uzbek language

K Madatov, S Bekchanov, J Vičič - 2022 - preprints.org
Stop words are very important for information retrieval and text analysis investigation. This
study aimed to automatically analyze and detect stop words in texts in the Uzbek language …

[HTML][HTML] Dataset of Karakalpak language stop words

K Madatov, S Bekchanov, J Vičič - Data in Brief, 2023 - Elsevier
The dataset presented in this paper aims to address the challenge of automatic extraction of
stop words in Natural Language Processing (NLP) for the low-resource Karakalpak …

Building a Comprehensive Uzbek Lexicon: Bridging Dialects for Text Standardization

DB Mengliev, NZ Abdurakhmonova… - 2024 IEEE 25th …, 2024 - ieeexplore.ieee.org
As part of the study, the authors developed a dictionary of the formal Uzbek language and its
dialects, which can be used in the tasks of standardizing mixed texts in various dialects of …

[HTML][HTML] Hybrid Naive Bayes TF-IDF Algorithm and Lexicon Approach for Sentiment Analysis of Reviews

AH Ramadhani, HQ Djauhari, V Lius… - International Journal of …, 2024 - cyberleninka.ru
Amidst the increasing reliance on social media for public expression, accurate sentiment
analysis has become essential, notably in assessing application reviews. This study focuses …

Enhancing Sentiment Analysis in Uzbek Language Texts through Weighted Lexical Features

D Mengliev, N Abdurakhmonova… - 2024 IEEE 25th …, 2024 - ieeexplore.ieee.org
This article presents an original sentiment analysis algorithm developed for analyzing Uzbek
language texts in order to correctly determine and identify the sentiment of the text. The …

The Algorithm of Uzbek Text Summarizer

KA Madatov, SK Bekchanov - 2024 IEEE 25th International …, 2024 - ieeexplore.ieee.org
The main goal of scientific researchers is to improve their knowledge by finding information
about their field from their daily news. In this case, the researcher faces the issue of …

Automating the Extraction of Words and Topics in Indonesian Using the Term Frequency-Inverse Document Frequency Algorithm and Latent Dirichlet Allocation

L Mutawalli, MTA Zaen, MF Zulkarnaen - JISA (Jurnal Informatika dan …, 2024 - trilogi.ac.id
Keyword extraction and topic modeling in the analysis of Gojek user reviews in Indonesian
are very important. By understanding user preferences and needs through keyword …