Kurdish Fake News Detection Based on Machine Learning Approaches

DA Salh, RM Nabi - Passer journal of basic and applied …, 2023 - passer.garmian.edu.krd
The widespread use of social media platforms and the internet has increased information
sharing, including both true and false news. Detecting fake news is challenging, and several …

[HTML][HTML] Kurdish news dataset headlines (KNDH) through multiclass classification

S Badawi, AM Saeed, SA Ahmed, PA Abdalla… - Data in brief, 2023 - Elsevier
The rapid growth of technology has massively increased the amount of text data. The data
can be mined and utilized for numerous natural language processing (NLP) tasks …

A language model for spell checking of educational texts in Kurdish (Sorani)

R Abdulrahman, H Hassani - … of the 1st Annual Meeting of the …, 2022 - aclanthology.org
Spell checkers are an integrated feature of most software applications handling text inputs.
When we write an email or compile a report on a desktop or a smartphone editor, a spell …

[HTML][HTML] KurdSum: A new benchmark dataset for the Kurdish text summarization

S Badawi - Natural Language Processing Journal, 2023 - Elsevier
Summarizing a text is the process of condensing its content while still maintaining its
essential information. With the abundance of digital information available, summarization …

Approaches to corpus creation for low-resource language technology: the case of Southern Kurdish and Laki

S Ahmadi, Z Azin, S Belelli… - arXiv preprint arXiv …, 2023 - arxiv.org
One of the major challenges that under-represented and endangered language
communities face in language technology is the lack or paucity of language data. This is …

CODET: A benchmark for contrastive dialectal evaluation of machine translation

MMI Alam, S Ahmadi, A Anastasopoulos - arXiv preprint arXiv:2305.17267, 2023 - arxiv.org
Neural machine translation (NMT) systems exhibit limited robustness in handling source-
side linguistic variations. Their performance tends to degrade when faced with even slight …

Transfer learning for low-resource sentiment analysis

R Hameed, S Ahmadi, F Daneshfar - arXiv preprint arXiv:2304.04703, 2023 - arxiv.org
Sentiment analysis is the process of identifying and extracting subjective information from
text. Despite the advances to employ cross-lingual approaches in an automatic way, the …

Using multilingual bidirectional encoder representations from transformers on medical corpus for Kurdish text classification

SS Badawi - Aro-The scientific Journal of Koya University, 2023 - aro.koyauniversity.org
Technology has dominated a huge part of human life. Furthermore, technology users use
language continuously to express feelings and sentiments about things. The science behind …

Jira: a Central Kurdish speech recognition system, designing and building speech corpus and pronunciation lexicon

H Veisi, H Hosseini, M MohammadAmini… - Language Resources …, 2022 - Springer
This paper introduces the first large vocabulary speech recognition system (LVSR) for the
Central Kurdish language, named Jira. The Kurdish language is an Indo-European …

Data Augmentation for Sorani kurdish News Headline classification using back-translation and deep learning model

S Badawi - Kurdistan Journal of Applied Research, 2023 - spu.edu.iq
With the increase in the volume of news articles and headlines being generated, it is
becoming more difficult for individuals to keep up with the latest developments and find …