Handwritten optical character recognition (OCR): A comprehensive systematic literature review (SLR)
Given the ubiquity of handwritten documents in human transactions, Optical Character
Recognition (OCR) of documents have invaluable practical worth. Optical character …
Recognition (OCR) of documents have invaluable practical worth. Optical character …
Text recognition in the wild: A survey
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …
information carried by text is important in a wide range of vision-based application …
Scene text recognition with permuted autoregressive sequence models
D Bautista, R Atienza - European conference on computer vision, 2022 - Springer
Context-aware STR methods typically use internal autoregressive (AR) language models
(LM). Inherent limitations of AR models motivated two-stage methods which employ an …
(LM). Inherent limitations of AR models motivated two-stage methods which employ an …
From two to one: A new scene text recognizer with visual language modeling network
In this paper, we abandon the dominant complex language model and rethink the linguistic
learning process in the scene text recognition. Different from previous methods considering …
learning process in the scene text recognition. Different from previous methods considering …
Abcnet: Real-time scene text spotting with adaptive bezier-curve network
Scene text detection and recognition has received increasing research attention. Existing
methods can be roughly categorized into two groups: character-based and segmentation …
methods can be roughly categorized into two groups: character-based and segmentation …
Text spotting transformers
In this paper, we present TExt Spotting TRansformers (TESTR), a generic end-to-end text
spotting framework using Transformers for text detection and recognition in the wild. TESTR …
spotting framework using Transformers for text detection and recognition in the wild. TESTR …
Turning a clip model into a scene text detector
The recent large-scale Contrastive Language-Image Pretraining (CLIP) model has shown
great potential in various downstream tasks via leveraging the pretrained vision and …
great potential in various downstream tasks via leveraging the pretrained vision and …
Towards end-to-end unified scene text detection and layout analysis
Scene text detection and document layout analysis have long been treated as two separate
tasks in different image domains. In this paper, we bring them together and introduce the …
tasks in different image domains. In this paper, we bring them together and introduce the …
Estextspotter: Towards better scene text spotting with explicit synergy in transformer
In recent years, end-to-end scene text spotting approaches are evolving to the Transformer-
based framework. While previous studies have shown the crucial importance of the intrinsic …
based framework. While previous studies have shown the crucial importance of the intrinsic …
Abcnet v2: Adaptive bezier-curve network for real-time end-to-end text spotting
End-to-end text-spotting, which aims to integrate detection and recognition in a unified
framework, has attracted increasing attention due to its simplicity of the two complimentary …
framework, has attracted increasing attention due to its simplicity of the two complimentary …