TrOCR Meets Language Models: An End-to-End Post-correction Approach
YH Chen, PB Ströbel - International Conference on Document Analysis …, 2024 - Springer
This study aims to enhance handwritten text recognition (HTR) performance and domain
adaptability by combining an optical character recognition (OCR) model with a language …
adaptability by combining an optical character recognition (OCR) model with a language …
Touching text line segmentation combined local baseline and connected component for uchen Tibetan historical documents
P Hu, W Wang, Q Li, T Wang - Information Processing & Management, 2021 - Elsevier
The text lines of ancient Tibetan books are skewed and distorted, strokes are broken, and
complex adjacent text lines touch each other, which makes text line segmentation extremely …
complex adjacent text lines touch each other, which makes text line segmentation extremely …
[HTML][HTML] Learning-free, divide and conquer text-line extraction algorithm for printed Arabic text with diacritics
The extraction of text lines from document images is a critical step in optical character
recognition. It is still considered an open document analysis problem. The presence of …
recognition. It is still considered an open document analysis problem. The presence of …
Script independent text segmentation of document images using graph network based shortest path scheme
Document image processing is one of the growing research fields in the digital world for
applications like data base indexing, text recognition, signature verification, web-searching …
applications like data base indexing, text recognition, signature verification, web-searching …
Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks
FC Fizaine, P Bard, M Paindavoine, C Robin… - Journal of …, 2024 - mdpi.com
Text line segmentation is a necessary preliminary step before most text transcription
algorithms are applied. The leading deep learning networks used in this context (ARU-Net …
algorithms are applied. The leading deep learning networks used in this context (ARU-Net …
In Codice Ratio: A crowd-enabled solution for low resource machine transcription of the Vatican Registers
E Nieddu, D Firmani, P Merialdo, M Maiorino - Information Processing & …, 2021 - Elsevier
Abstract In Codice Ratio is a research project to study techniques for analyzing the contents
of historical documents conserved in the Vatican Apostolic Archives. In this paper, we …
of historical documents conserved in the Vatican Apostolic Archives. In this paper, we …
Handwriting-Based Text Line Segmentation from Malayalam Documents
Featured Application The proposed technique and the database created will be useful for
the development of an optical character recognition system for Malayalam handwritten …
the development of an optical character recognition system for Malayalam handwritten …
Text line segmentation of offline malayalam handwritten document
AT Anju, BP Chacko, KP Basheer - AIP Conference Proceedings, 2024 - pubs.aip.org
Automatic handwriting recognition systems rely on text line segmentation. Text line
segmentation faces a number of obstacles, such as irregular text line gaps, slanted text …
segmentation faces a number of obstacles, such as irregular text line gaps, slanted text …
Text Line Segmentation on Ancient Egyptian Papyri: Layout Analysis with Object Detection Networks and Connected Components
SM Unter - International Conference on Document Analysis and …, 2024 - Springer
The automatic localization of text lines is an important step in the analysis of handwritten
historical documents. It is a valuable tool for further analysis, such as studying handwriting …
historical documents. It is a valuable tool for further analysis, such as studying handwriting …
[PDF][PDF] Strokes Trajectory Recovery for Unconstrained Handwritten Documents with Automatic Evaluation.
S Hanif, LJ Latecki - ICPRAM, 2023 - cis.temple.edu
The focus of this paper is offline handwriting Stroke Trajectory Recovery (STR), which
facilitates the tasks such as handwriting recognition and synthesis. The input is an image of …
facilitates the tasks such as handwriting recognition and synthesis. The input is an image of …