TrOCR Meets Language Models: An End-to-End Post-correction Approach

YH Chen, PB Ströbel - International Conference on Document Analysis …, 2024 - Springer
This study aims to enhance handwritten text recognition (HTR) performance and domain
adaptability by combining an optical character recognition (OCR) model with a language …

Touching text line segmentation combined local baseline and connected component for uchen Tibetan historical documents

P Hu, W Wang, Q Li, T Wang - Information Processing & Management, 2021 - Elsevier
The text lines of ancient Tibetan books are skewed and distorted, strokes are broken, and
complex adjacent text lines touch each other, which makes text line segmentation extremely …

[HTML][HTML] Learning-free, divide and conquer text-line extraction algorithm for printed Arabic text with diacritics

A Qaroush, A Awad, A Hanani, K Mohammad… - Journal of King Saud …, 2022 - Elsevier
The extraction of text lines from document images is a critical step in optical character
recognition. It is still considered an open document analysis problem. The presence of …

Script independent text segmentation of document images using graph network based shortest path scheme

P Sahare, JV Tembhurne, MR Parate, T Diwan… - International Journal of …, 2023 - Springer
Document image processing is one of the growing research fields in the digital world for
applications like data base indexing, text recognition, signature verification, web-searching …

Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks

FC Fizaine, P Bard, M Paindavoine, C Robin… - Journal of …, 2024 - mdpi.com
Text line segmentation is a necessary preliminary step before most text transcription
algorithms are applied. The leading deep learning networks used in this context (ARU-Net …

In Codice Ratio: A crowd-enabled solution for low resource machine transcription of the Vatican Registers

E Nieddu, D Firmani, P Merialdo, M Maiorino - Information Processing & …, 2021 - Elsevier
Abstract In Codice Ratio is a research project to study techniques for analyzing the contents
of historical documents conserved in the Vatican Apostolic Archives. In this paper, we …

Handwriting-Based Text Line Segmentation from Malayalam Documents

P PV, D Sankar - Applied Sciences, 2023 - mdpi.com
Featured Application The proposed technique and the database created will be useful for
the development of an optical character recognition system for Malayalam handwritten …

Text line segmentation of offline malayalam handwritten document

AT Anju, BP Chacko, KP Basheer - AIP Conference Proceedings, 2024 - pubs.aip.org
Automatic handwriting recognition systems rely on text line segmentation. Text line
segmentation faces a number of obstacles, such as irregular text line gaps, slanted text …

Text Line Segmentation on Ancient Egyptian Papyri: Layout Analysis with Object Detection Networks and Connected Components

SM Unter - International Conference on Document Analysis and …, 2024 - Springer
The automatic localization of text lines is an important step in the analysis of handwritten
historical documents. It is a valuable tool for further analysis, such as studying handwriting …

[PDF][PDF] Strokes Trajectory Recovery for Unconstrained Handwritten Documents with Automatic Evaluation.

S Hanif, LJ Latecki - ICPRAM, 2023 - cis.temple.edu
The focus of this paper is offline handwriting Stroke Trajectory Recovery (STR), which
facilitates the tasks such as handwriting recognition and synthesis. The input is an image of …