Survey of post-OCR processing approaches

TTH Nguyen, A Jatowt, M Coustaty… - ACM Computing Surveys …, 2021 - dl.acm.org
Optical character recognition (OCR) is one of the most popular techniques used for
converting printed documents into machine-readable ones. While OCR engines can do well …

Survey of automatic spelling correction

D Hládek, J Staš, M Pleva - Electronics, 2020 - mdpi.com
Automatic spelling correction has been receiving sustained research attention. Although
each article contains a brief introduction to the topic, there is a lack of work that would …

Enhancement of license plate recognition performance using Xception with Mish activation function

A Pattanaik, RC Balabantaray - Multimedia tools and applications, 2023 - Springer
The current breakthroughs in the highway research sector have resulted in a greater
awareness and focus on the construction of an effective Intelligent Transportation System …

A survey of mono-and multi-lingual character recognition using deep and shallow architectures: indic and non-indic scripts

S Kaur, S Bawa, R Kumar - Artificial Intelligence Review, 2020 - Springer
The cultural and regional diversity across the world and specifically in India has given birth
to a large number of writing systems and scripts having a variety of character sets. For scripts …

Searching the PDF Haystack: automated knowledge discovery in scanned EHR documents

AL Kostrinsky-Thomas, FM Hisama… - Applied Clinical …, 2021 - thieme-connect.com
Background Clinicians express concern that they may be unaware of important information
contained in voluminous scanned and other outside documents contained in electronic …

Reproducible research in document analysis and recognition

JR Fonseca Cacho, K Taghva - Information Technology-New Generations …, 2018 - Springer
With reproducible research becoming a de facto standard in computational sciences, many
approaches have been explored to enable researchers in other disciplines to adopt this …

Data and Process Quality Evaluation in a Textual Big Data Archiving System

M Fugini, J Finocchi - ACM Journal on Computing and Cultural Heritage …, 2022 - dl.acm.org
The article presents a textual Big Data analytics solution developed in a real setting as a part
of a high-capacity document digitization and storage system. A software based on machine …

Spelling correction of OCR-generated hindi text using word embedding and levenshtein distance

S Srigiri, SK Saha - … , Circuits and Communication Systems: Proceeding of …, 2020 - Springer
Abstract Optical Character Recognition (OCR) systems for Indian languages including Hindi
often suffer from poor accuracy due to the wide character variety, compound characters …

Improving OCR post processing with machine learning tools

JRF Cacho - 2019 - search.proquest.com
Abstract Optical Character Recognition (OCR) Post Processing involves data cleaning steps
for documents that were digitized, such as a book or a newspaper article. One step in this …

Improving OCR post processing with machine learning tools

JR Fonseca Cacho - 2019 - digitalscholarship.unlv.edu
Abstract Optical Character Recognition (OCR) Post Processing involves data cleaning steps
for documents that were digitized, such as a book or a newspaper article. One step in this …