Enhancing optical character recognition: Efficient techniques for document layout analysis and text line detection

A Fateh, M Fateh, V Abolghasemi - Engineering Reports, 2024 - Wiley Online Library
In recent years, automatic document and text analysis has gained significant importance,
driven by advancements in optical character recognition (OCR) technology and the need for …

Persian printed text line detection based on font size

A Fateh, M Rezvani, A Tajary, M Fateh - Multimedia Tools and …, 2023 - Springer
Text line segmentation is an essential step in the process of converting document images
into text. In OCR systems, text line segmentation affects the character segmentation stage …

[PDF][PDF] 基于自适应游程平滑算法的藏文文档图像版面分割与描述

陈园园, 王维兰, 刘华明, 蔡正琦… - Laser & Optoelectronics …, 2021 - researching.cn
摘要版面分割是文档图像分析与识别过程中的重要基础步骤, 为了探索适用于藏文文档图像版面
分割与描述的方法, 提出一种基于自适应游程平滑算法的研究方法. 根据藏文文档图像的版面 …

Historical document image analysis: a structural approach based on texture

M Mehri - 2015 - theses.hal.science
Over the last few years, there has been tremendous growth in digitizing collections of
cultural heritage documents. Thus, many challenges and open issues have been raised …

Text Line Detection and Correction for Challenging Datasets: A Case Study with Newspapers Dataset

A Fateh, V Abolghasemi - … Text Line Detection and Correction for …, 2023 - papers.ssrn.com
Abstract Official Iranian Newspapers (OIN) are those in which companies' registration
information is published. To digitally store the data of these newspapers, we need to convert …

[图书][B] HOLMES: A Hybrid Ontology-Learning Materials Engineering System

MFM Remolona - 2018 - search.proquest.com
Designing and discovering novel materials is challenging problem in many domains such as
fuel additives, composites, pharmaceuticals, and so on. At the core of all this are models that …

Data extraction from scanned invoice documents in multiple languages

N Aggarwal, S Patra, S Sinha… - … Workshop on Signal …, 2023 - spiedigitallibrary.org
This work provides an open-source method for extracting rel-evant information from scanned
documents, such as bills, bank accounts, and invoices. The solution supports documents in …