[PDF][PDF] The effects of pre-processing techniques on Arabic text classification

A El Kah, I Zeroual - Int. J, 2021 - academia.edu
In the last two decades, the amount of available Arabic text data on the World Wide Web is
dramatically growing, making it the fourth most used language on the web. Accordingly, the …

Arabic information retrieval: Stemming or lemmatization?

I Zeroual, A Lakhouaja - 2017 Intelligent Systems and …, 2017 - ieeexplore.ieee.org
The Arabic language is expanding in the world. According to UNESCO, the Arabic language
is spoken by more than 422 million native speakers around 29 countries and among 1.6 …

The power of language music: Arabic lemmatization through patterns

M Attia, A Zirikly, M Diab - Proceedings of the 5th Workshop on …, 2016 - aclanthology.org
The interaction between roots and patterns in Arabic has intrigued lexicographers and
morphologists for centuries. While roots provide the consonantal building blocks, patterns …

Deep learning approach for automatic Romanian lemmatization

M Nuţu - Procedia Computer Science, 2021 - Elsevier
This paper proposes a deep learning sequence-to-sequence approach to improve the task
of automatic Romanian lemmatization. The study compares 24 systems using different …

Improving the Accessibility of Arabic Electronic Theses and Dissertations (ETDs) with Metadata and Classification

E Abdelrahman - 2021 - vtechworks.lib.vt.edu
Much research work has been done to extract data from scientific papers, journals, and
articles. However, Electronic Theses and Dissertations (ETDs) remain an unexplored genre …

[PDF][PDF] Machine Learning based Solutions for Text Processing and Speech Synthesis

N Maria - 2024 - ebooks.unitbv.ro
Machine Learning based Solutions for Text Processing and Speech Synthesis Page 1 Maria
NUȚU Machine Learning based Solutions for Text Processing and Speech Synthesis 2024 …

[PDF][PDF] Otrouha: Automatic Classification of Arabic ETDs

E Abdelrahman, F Alotaibi - vtechworks.lib.vt.edu
ETDs are becoming a new genre of documents that is highly precious and worthy of
preservation. This has resulted in a sustainable need to build an effective tool to facilitate …

[PDF][PDF] Building Arabic Corpora: Concepts, Methodologies, Tools, and Experiments

I Zeroual - 2018 - researchgate.net
The term corpus comes from Latin and means “body”. According to corpus linguists, a
corpus can be defined as a collection of machine-readable authentic texts, including …