[HTML][HTML] Learning-free, divide and conquer text-line extraction algorithm for printed Arabic text with diacritics

A Qaroush, A Awad, A Hanani, K Mohammad… - Journal of King Saud …, 2022 - Elsevier
The extraction of text lines from document images is a critical step in optical character
recognition. It is still considered an open document analysis problem. The presence of …

GAN-based text line segmentation method for challenging handwritten documents

İ Özşeker, AA Demir, U Özkaya - International Journal on Document …, 2024 - Springer
Text line segmentation (TLS) is an essential step of the end-to-end document analysis
systems. The main purpose of this step is to extract the individual text lines of any …

Understanding unsupervised deep learning for text line segmentation

A Droby, B Kurar Barakat, R Saabni, R Alaasam… - Applied Sciences, 2022 - mdpi.com
We propose an unsupervised feature learning approach for segmenting text lines of
handwritten document images with no labelling effort. Humans can easily group local text …

Learning‐Based Ordering Characters on Ancient Document

H Lee, RH Baek, HC Choi - Computational Intelligence and …, 2022 - Wiley Online Library
Digitalizing and translating a scanned document image entails detecting the characters
using a detector and translating the characters in the order they were detected with a …

[PDF][PDF] Computational Qumranic Paleography

BK Barakat, N Dershowitz - 2023 - beratkurar.github.io
2 Objectives The goal of this research project is to apply modern computer-vision tools to
analyze paleographic features of the handwriting of ancient fragmentary texts that are now …

[PDF][PDF] eScriptorium comparison

B Kurar-Barakat, N Dershowitz - 2022 - beratkurar.github.io
12.09. 2022 eScriptorium is an open source document image analysis platform with a web
interface that allows users to segment and transcribe document images [6]. The …

[PDF][PDF] A thousand word images are worth a word

BK Barakat, T Lu, A Dooms - 2021 - beratkurar.github.io
2.2 Objectives A lot of literature in handwritten document image analysis have explored
different deep learning models. The researchers now know to achieve baseline results using …