A survey of OCR in Arabic language: applications, techniques, and challenges

S Faizullah, MS Ayub, S Hussain, MA Khan - Applied Sciences, 2023 - mdpi.com
Optical character recognition (OCR) is the process of extracting handwritten or printed text
from a scanned or printed image and converting it to a machine-readable form for further …

A survey on Arabic character segmentation

YM Alginahi - International Journal on Document Analysis and …, 2013 - Springer
Arabic character segmentation is a necessary step in Arabic Optical Character Recognition
(OCR). The cursive nature of Arabic script poses challenging problems in Arabic character …

Handwritten Urdu character recognition using one-dimensional BLSTM classifier

SB Ahmed, S Naz, S Swati, MI Razzak - Neural Computing and …, 2019 - Springer
The recognition of cursive script is regarded as a subtle task in optical character recognition
due to its varied representation. Every cursive script has different nature and associated …

A new arabic printed text image database and evaluation protocols

F Slimane, R Ingold, S Kanoun… - 2009 10th …, 2009 - ieeexplore.ieee.org
We report on the creation of a database composed of images of Arabic Printed words. The
purpose of this database is the large-scale benchmarking of open-vocabulary, multi-font …

A study on font-family and font-size recognition applied to Arabic word images at ultra-low resolution

F Slimane, S Kanoun, J Hennebert, AM Alimi… - Pattern Recognition …, 2013 - Elsevier
In this paper, we propose a new font and size identification method for ultra-low resolution
Arabic word images using a stochastic approach. The literature has proved the difficulty for …

Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey

MSE Kasem, M Mahmoud, HS Kang - arXiv preprint arXiv:2312.11812, 2023 - arxiv.org
Optical character recognition (OCR) is a vital process that involves the extraction of
handwritten or printed text from scanned or printed images, converting it into a format that …

IESK-ArDB: a database for handwritten Arabic and an optimized topological segmentation approach

M Elzobi, A Al-Hamadi, Z Al Aghbari… - International Journal on …, 2013 - Springer
Even though a lot of researches have been conducted in order to solve the problem of
unconstrained handwriting recognition, an effective solution is still a serious challenge. In …

Ocformer: A transformer-based model for arabic handwritten text recognition

A Mostafa, O Mohamed, A Ashraf… - 2021 International …, 2021 - ieeexplore.ieee.org
The Optical Character Recognition (OCR) of Arabic historical documents is a challenging
task. The reason being the complexity of the layout and the highly variant typography …

A novel Arabic OCR post-processing using rule-based and word context techniques

IA Doush, F Alkhateeb, AH Gharaibeh - International Journal on Document …, 2018 - Springer
Optical character recognition (OCR) is the process of recognizing characters automatically
from scanned documents for editing, indexing, searching, and reducing the storage space …

ICDAR 2011-arabic recognition competition: Multi-font multi-size digitally represented text

F Slimane, S Kanoun, H El Abed… - 2011 International …, 2011 - ieeexplore.ieee.org
This paper describes the Arabic Recognition Competition: Multi-font Multi-size Digitally
Represented Text held in the context of the 11 ^th International Conference on Document …