Nabra: Syrian Arabic Dialects with Morphological Annotations
This paper presents Nabra, a corpora of Syrian Arabic dialects with morphological
annotations. A team of Syrian natives collected more than 6K sentences containing about …
annotations. A team of Syrian natives collected more than 6K sentences containing about …
Lîsan: Yemeni, Iraqi, Libyan, and Sudanese Arabic Dialect Corpora with Morphological Annotations
This article presents morphologically-annotated Yemeni, Sudanese, Iraqi, and Libyan Arabic
dialects (L̂isān) corpora. L̂isān features around 1.2 million tokens. We collected the …
dialects (L̂isān) corpora. L̂isān features around 1.2 million tokens. We collected the …
Chapter Introduction: Linguistic identities in the Arab Gulf states: Waves of change
The introductory chapter provides an overview of the book's main theme: linguistic identities
in the Arab Gulf States and waves of change. The introduction discusses the content of the …
in the Arab Gulf States and waves of change. The introduction discusses the content of the …
Morphological analysis and disambiguation for Gulf Arabic: The interplay between resources and methods
In this paper we present the first full morphological analysis and disambiguation system for
Gulf Arabic. We use an existing state-of-the-art morphological disambiguation system to …
Gulf Arabic. We use an existing state-of-the-art morphological disambiguation system to …
A little linguistics goes a long way: Unsupervised segmentation with limited language specific guidance
We present de-lexical segmentation, a linguistically motivated alternative to greedy or other
unsupervised methods, requiring only minimal language specific input. Our technique …
unsupervised methods, requiring only minimal language specific input. Our technique …
The Najdi Arabic Corpus: a new corpus for an underrepresented Arabic dialect
R Alhedayani - Language Resources and Evaluation, 2024 - Springer
This paper presents a new corpus for a dialect of Arabic spoken in the central region of
Saudi Arabia: the Najdi Arabic Corpus. This is the first publicly available corpus for this …
Saudi Arabia: the Najdi Arabic Corpus. This is the first publicly available corpus for this …
LexArabic: A receptive vocabulary size test to estimate Arabic proficiency
A Alzahrani - Behavior Research Methods, 2024 - Springer
Arabic is understudied in second-language research (L2) and lacks rapid and adequate
tools for measuring proficiency. Drawing inspiration from LexTALE and its extensions, this …
tools for measuring proficiency. Drawing inspiration from LexTALE and its extensions, this …
Is Arabic punctuation rule-governed?
This paper investigates the extent to which Arabic punctuation is rule-governed, with the aim
of improving text comprehension, disambiguation, and machine translation. The study …
of improving text comprehension, disambiguation, and machine translation. The study …
[HTML][HTML] Advancing AI-Driven Linguistic Analysis: Developing and Annotating Comprehensive Arabic Dialect Corpora for Gulf Countries and Saudi Arabia
This study harnesses the linguistic diversity of Arabic dialects to create two expansive
corpora from X (formerly Twitter). The Gulf Arabic Corpus (GAC-6) includes around 1.7 …
corpora from X (formerly Twitter). The Gulf Arabic Corpus (GAC-6) includes around 1.7 …
Maknuune: A Large Open Palestinian Arabic Lexicon
We present Maknuune, a large open lexicon for the Palestinian Arabic dialect. Maknuune
has over 36K entries from 17K lemmas, and 3.7 K roots. All entries include diacritized Arabic …
has over 36K entries from 17K lemmas, and 3.7 K roots. All entries include diacritized Arabic …