Handwritten optical character recognition (OCR): A comprehensive systematic literature review (SLR)

J Memon, M Sami, RA Khan, M Uddin - IEEE access, 2020 - ieeexplore.ieee.org
Given the ubiquity of handwritten documents in human transactions, Optical Character
Recognition (OCR) of documents have invaluable practical worth. Optical character …

Text recognition in the wild: A survey

X Chen, L Jin, Y Zhu, C Luo, T Wang - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …

Scene text recognition with permuted autoregressive sequence models

D Bautista, R Atienza - European conference on computer vision, 2022 - Springer
Context-aware STR methods typically use internal autoregressive (AR) language models
(LM). Inherent limitations of AR models motivated two-stage methods which employ an …

From two to one: A new scene text recognizer with visual language modeling network

Y Wang, H Xie, S Fang, J Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we abandon the dominant complex language model and rethink the linguistic
learning process in the scene text recognition. Different from previous methods considering …

Abcnet: Real-time scene text spotting with adaptive bezier-curve network

Y Liu, H Chen, C Shen, T He, L Jin… - proceedings of the …, 2020 - openaccess.thecvf.com
Scene text detection and recognition has received increasing research attention. Existing
methods can be roughly categorized into two groups: character-based and segmentation …

Text spotting transformers

X Zhang, Y Su, S Tripathi, Z Tu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
In this paper, we present TExt Spotting TRansformers (TESTR), a generic end-to-end text
spotting framework using Transformers for text detection and recognition in the wild. TESTR …

Turning a clip model into a scene text detector

W Yu, Y Liu, W Hua, D Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent large-scale Contrastive Language-Image Pretraining (CLIP) model has shown
great potential in various downstream tasks via leveraging the pretrained vision and …

Towards end-to-end unified scene text detection and layout analysis

S Long, S Qin, D Panteleev… - Proceedings of the …, 2022 - openaccess.thecvf.com
Scene text detection and document layout analysis have long been treated as two separate
tasks in different image domains. In this paper, we bring them together and introduce the …

Estextspotter: Towards better scene text spotting with explicit synergy in transformer

M Huang, J Zhang, D Peng, H Lu… - Proceedings of the …, 2023 - openaccess.thecvf.com
In recent years, end-to-end scene text spotting approaches are evolving to the Transformer-
based framework. While previous studies have shown the crucial importance of the intrinsic …

Abcnet v2: Adaptive bezier-curve network for real-time end-to-end text spotting

Y Liu, C Shen, L Jin, T He, P Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
End-to-end text-spotting, which aims to integrate detection and recognition in a unified
framework, has attracted increasing attention due to its simplicity of the two complimentary …