A survey on arabic named entity recognition: Past, recent advances, and future trends

X Qu, Y Gu, Q Xia, Z Li, Z Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
As more and more Arabic texts emerged on the Internet, extracting important information
from these Arabic texts is especially useful. As a fundamental technology, Named entity …

A survey on the state-of-the-art machine learning models in the context of NLP

W Khan, A Daud, JA Nasir, T Amjad - Kuwait journal of Science, 2016 - journalskuwait.org
KJS inside pages October 2016.indd Page 1 Kuwait J. Sci. 43 (4) pp. 95-113, 2016 A survey on
the state-of-the-art machine learning models in the context of NLP Wahab Khan1,*, Ali Daud2,1 …

The interplay of variant, size, and task type in Arabic pre-trained language models

G Inoue, B Alhafni, N Baimukan, H Bouamor… - arXiv preprint arXiv …, 2021 - arxiv.org
In this paper, we explore the effects of language variants, data sizes, and fine-tuning task
types in Arabic pre-trained language models. To do so, we build three pre-trained language …

Jais and jais-chat: Arabic-centric foundation and instruction-tuned open generative large language models

N Sengupta, SK Sahu, B Jia, S Katipomu, H Li… - arXiv preprint arXiv …, 2023 - arxiv.org
We introduce Jais and Jais-chat, new state-of-the-art Arabic-centric foundation and
instruction-tuned open generative large language models (LLMs). The models are based on …

Adapting pre-trained language models to African languages via multilingual adaptive fine-tuning

JO Alabi, DI Adelani, M Mosbach, D Klakow - arXiv preprint arXiv …, 2022 - arxiv.org
Multilingual pre-trained language models (PLMs) have demonstrated impressive
performance on several downstream tasks for both high-resourced and low-resourced …

CAMeL tools: An open source python toolkit for Arabic natural language processing

O Obeid, N Zalmout, S Khalifa, D Taji… - Proceedings of the …, 2020 - aclanthology.org
Abstract We present CAMeL Tools, a collection of open-source tools for Arabic natural
language processing in Python. CAMeL Tools currently provides utilities for pre-processing …

Having beer after prayer? measuring cultural bias in large language models

T Naous, MJ Ryan, A Ritter, W Xu - arXiv preprint arXiv:2305.14456, 2023 - arxiv.org
As the reach of large language models (LMs) expands globally, their ability to cater to
diverse cultural contexts becomes crucial. Despite advancements in multilingual …

AraELECTRA: Pre-training text discriminators for Arabic language understanding

W Antoun, F Baly, H Hajj - arXiv preprint arXiv:2012.15516, 2020 - arxiv.org
Advances in English language representation enabled a more sample-efficient pre-training
task by Efficiently Learning an Encoder that Classifies Token Replacements Accurately …

Wojood: Nested arabic named entity corpus and recognition using bert

M Jarrar, M Khalilia, S Ghanem - arXiv preprint arXiv:2205.09651, 2022 - arxiv.org
This paper presents Wojood, a corpus for Arabic nested Named Entity Recognition (NER).
Nested entities occur when one entity mention is embedded inside another entity mention …

A survey of arabic named entity recognition and classification

K Shaalan - Computational Linguistics, 2014 - direct.mit.edu
As more and more Arabic textual information becomes available through the Web in homes
and businesses, via Internet and Intranet services, there is an urgent need for technologies …