Kurdish Fake News Detection Based on Machine Learning Approaches
The widespread use of social media platforms and the internet has increased information
sharing, including both true and false news. Detecting fake news is challenging, and several …
sharing, including both true and false news. Detecting fake news is challenging, and several …
[HTML][HTML] Kurdish news dataset headlines (KNDH) through multiclass classification
The rapid growth of technology has massively increased the amount of text data. The data
can be mined and utilized for numerous natural language processing (NLP) tasks …
can be mined and utilized for numerous natural language processing (NLP) tasks …
A language model for spell checking of educational texts in Kurdish (Sorani)
R Abdulrahman, H Hassani - … of the 1st Annual Meeting of the …, 2022 - aclanthology.org
Spell checkers are an integrated feature of most software applications handling text inputs.
When we write an email or compile a report on a desktop or a smartphone editor, a spell …
When we write an email or compile a report on a desktop or a smartphone editor, a spell …
[HTML][HTML] KurdSum: A new benchmark dataset for the Kurdish text summarization
S Badawi - Natural Language Processing Journal, 2023 - Elsevier
Summarizing a text is the process of condensing its content while still maintaining its
essential information. With the abundance of digital information available, summarization …
essential information. With the abundance of digital information available, summarization …
Approaches to corpus creation for low-resource language technology: the case of Southern Kurdish and Laki
One of the major challenges that under-represented and endangered language
communities face in language technology is the lack or paucity of language data. This is …
communities face in language technology is the lack or paucity of language data. This is …
CODET: A benchmark for contrastive dialectal evaluation of machine translation
Neural machine translation (NMT) systems exhibit limited robustness in handling source-
side linguistic variations. Their performance tends to degrade when faced with even slight …
side linguistic variations. Their performance tends to degrade when faced with even slight …
Transfer learning for low-resource sentiment analysis
Sentiment analysis is the process of identifying and extracting subjective information from
text. Despite the advances to employ cross-lingual approaches in an automatic way, the …
text. Despite the advances to employ cross-lingual approaches in an automatic way, the …
Using multilingual bidirectional encoder representations from transformers on medical corpus for Kurdish text classification
SS Badawi - Aro-The scientific Journal of Koya University, 2023 - aro.koyauniversity.org
Technology has dominated a huge part of human life. Furthermore, technology users use
language continuously to express feelings and sentiments about things. The science behind …
language continuously to express feelings and sentiments about things. The science behind …
Jira: a Central Kurdish speech recognition system, designing and building speech corpus and pronunciation lexicon
This paper introduces the first large vocabulary speech recognition system (LVSR) for the
Central Kurdish language, named Jira. The Kurdish language is an Indo-European …
Central Kurdish language, named Jira. The Kurdish language is an Indo-European …
Data Augmentation for Sorani kurdish News Headline classification using back-translation and deep learning model
S Badawi - Kurdistan Journal of Applied Research, 2023 - spu.edu.iq
With the increase in the volume of news articles and headlines being generated, it is
becoming more difficult for individuals to keep up with the latest developments and find …
becoming more difficult for individuals to keep up with the latest developments and find …