CAMeL tools: An open source python toolkit for Arabic natural language processing

O Obeid, N Zalmout, S Khalifa, D Taji… - Proceedings of the …, 2020 - aclanthology.org
Abstract We present CAMeL Tools, a collection of open-source tools for Arabic natural
language processing in Python. CAMeL Tools currently provides utilities for pre-processing …

A panoramic survey of natural language processing in the Arab world

K Darwish, N Habash, M Abbas, H Al-Khalifa… - Communications of the …, 2021 - dl.acm.org
THE TERM NATURAL language refers to any system of symbolic communication (spoken,
signed, or written) that has evolved naturally in humans without intentional human planning …

UniMorph 4.0: universal morphology

K Batsuren, O Goldman, S Khalifa, N Habash… - arXiv preprint arXiv …, 2022 - arxiv.org
The Universal Morphology (UniMorph) project is a collaborative effort providing broad-
coverage instantiated normalized morphological inflection tables for hundreds of diverse …

Gender-aware reinflection using linguistically enhanced neural models

B Alhafni, N Habash, H Bouamor - Proceedings of the Second …, 2020 - aclanthology.org
In this paper, we present an approach for sentence-level gender reinflection using
linguistically enhanced sequence-to-sequence models. Our system takes an Arabic …

Camelira: An Arabic multi-dialect morphological disambiguator

O Obeid, G Inoue, N Habash - arXiv preprint arXiv:2211.16807, 2022 - arxiv.org
We present Camelira, a web-based Arabic multi-dialect morphological disambiguation tool
that covers four major variants of Arabic: Modern Standard Arabic, Egyptian, Gulf, and …

Automatic error type annotation for Arabic

R Belkebir, N Habash - arXiv preprint arXiv:2109.08068, 2021 - arxiv.org
We present ARETA, an automatic error type annotation system for Modern Standard Arabic.
We design ARETA to address Arabic's morphological richness and orthographic ambiguity …

ALMA: Fast Lemmatizer and POS Tagger for Arabic

M Jarrar, D Akra, T Hammouda - Procedia Computer Science, 2024 - Elsevier
We introduce Alma (), an open-source and state-of-the-art lemmatizer, POS tagger, and root
tagger for Arabic, boasting both high speed and accuracy. Alma relies on a dictionary of …

Morphotactic modeling in an open-source multi-dialectal Arabic morphological analyzer and generator

N Habash, R Marzouk, C Khairallah… - Proceedings of the 19th …, 2022 - aclanthology.org
Arabic is a morphologically rich and complex language, with numerous dialectal variants.
Previous efforts on Arabic morphology modeling focused on specific variants and specific …

Recent advancements in computational morphology: A comprehensive survey

J Baxi, B Bhatt - arXiv preprint arXiv:2406.05424, 2024 - arxiv.org
Computational morphology handles the language processing at the word level. It is one of
the foundational tasks in the NLP pipeline for the development of higher level NLP …

Arabic word-level readability visualization for assisted text simplification

R Hazim, H Saddiki, B Alhafni, MA Khalil… - arXiv preprint arXiv …, 2022 - arxiv.org
This demo paper presents a Google Docs add-on for automatic Arabic word-level readability
visualization. The add-on includes a lemmatization component that is connected to a five …