Nabra: Syrian Arabic Dialects with Morphological Annotations

A Nayouf, T Hammouda, M Jarrar, F Zaraket… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper presents Nabra, a corpora of Syrian Arabic dialects with morphological
annotations. A team of Syrian natives collected more than 6K sentences containing about …

Lîsan: Yemeni, Iraqi, Libyan, and Sudanese Arabic Dialect Corpora with Morphological Annotations

M Jarrar, FA Zaraket, T Hammouda… - 2023 20th ACS/IEEE …, 2023 - ieeexplore.ieee.org
This article presents morphologically-annotated Yemeni, Sudanese, Iraqi, and Libyan Arabic
dialects (L̂isān) corpora. L̂isān features around 1.2 million tokens. We collected the …

Chapter Introduction: Linguistic identities in the Arab Gulf states: Waves of change

S Hopkyns, W Zoghbor - Linguistic Identities in the Arab Gulf …, 2022 - library.oapen.org
The introductory chapter provides an overview of the book's main theme: linguistic identities
in the Arab Gulf States and waves of change. The introduction discusses the content of the …

Morphological analysis and disambiguation for Gulf Arabic: The interplay between resources and methods

S Khalifa, N Zalmout, N Habash - Proceedings of the Twelfth …, 2020 - aclanthology.org
In this paper we present the first full morphological analysis and disambiguation system for
Gulf Arabic. We use an existing state-of-the-art morphological disambiguation system to …

A little linguistics goes a long way: Unsupervised segmentation with limited language specific guidance

A Erdmann, S Khalifa, M Oudah… - Proceedings of the …, 2019 - aclanthology.org
We present de-lexical segmentation, a linguistically motivated alternative to greedy or other
unsupervised methods, requiring only minimal language specific input. Our technique …

The Najdi Arabic Corpus: a new corpus for an underrepresented Arabic dialect

R Alhedayani - Language Resources and Evaluation, 2024 - Springer
This paper presents a new corpus for a dialect of Arabic spoken in the central region of
Saudi Arabia: the Najdi Arabic Corpus. This is the first publicly available corpus for this …

LexArabic: A receptive vocabulary size test to estimate Arabic proficiency

A Alzahrani - Behavior Research Methods, 2024 - Springer
Arabic is understudied in second-language research (L2) and lacks rapid and adequate
tools for measuring proficiency. Drawing inspiration from LexTALE and its extensions, this …

Is Arabic punctuation rule-governed?

S Yagi, S Fareh, A Elnagar, M Balajeed… - Cogent Arts & …, 2024 - Taylor & Francis
This paper investigates the extent to which Arabic punctuation is rule-governed, with the aim
of improving text comprehension, disambiguation, and machine translation. The study …

[HTML][HTML] Advancing AI-Driven Linguistic Analysis: Developing and Annotating Comprehensive Arabic Dialect Corpora for Gulf Countries and Saudi Arabia

N Al-Shenaifi, AM Azmi, M Hosny - Mathematics, 2024 - mdpi.com
This study harnesses the linguistic diversity of Arabic dialects to create two expansive
corpora from X (formerly Twitter). The Gulf Arabic Corpus (GAC-6) includes around 1.7 …

Maknuune: A Large Open Palestinian Arabic Lexicon

S Dibas, C Khairallah, N Habash, OF Sadi… - arXiv preprint arXiv …, 2022 - arxiv.org
We present Maknuune, a large open lexicon for the Palestinian Arabic dialect. Maknuune
has over 36K entries from 17K lemmas, and 3.7 K roots. All entries include diacritized Arabic …