Pyramid vision transformer: A versatile backbone for dense prediction without convolutions
Although convolutional neural networks (CNNs) have achieved great success in computer
vision, this work investigates a simpler, convolution-free backbone network useful for many …
vision, this work investigates a simpler, convolution-free backbone network useful for many …
MPDIoU: a loss for efficient and accurate bounding box regression
S Ma, Y Xu - arXiv preprint arXiv:2307.07662, 2023 - arxiv.org
Bounding box regression (BBR) has been widely used in object detection and instance
segmentation, which is an important step in object localization. However, most of the existing …
segmentation, which is an important step in object localization. However, most of the existing …
Deepsolo: Let transformer decoder with explicit points solo for text spotting
End-to-end text spotting aims to integrate scene text detection and recognition into a unified
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …
Swintextspotter: Scene text spotting via better synergy between text detection and text recognition
End-to-end scene text spotting has attracted great attention in recent years due to the
success of excavating the intrinsic synergy of the scene text detection and recognition …
success of excavating the intrinsic synergy of the scene text detection and recognition …
Estextspotter: Towards better scene text spotting with explicit synergy in transformer
In recent years, end-to-end scene text spotting approaches are evolving to the Transformer-
based framework. While previous studies have shown the crucial importance of the intrinsic …
based framework. While previous studies have shown the crucial importance of the intrinsic …
Abinet++: Autonomous, bidirectional and iterative language modeling for scene text spotting
Scene text spotting is of great importance to the computer vision community due to its wide
variety of applications. Recent methods attempt to introduce linguistic knowledge for …
variety of applications. Recent methods attempt to introduce linguistic knowledge for …
On the arbitrary-oriented object detection: Classification based approaches revisited
Arbitrary-oriented object detection has been a building block for rotation sensitive tasks. We
first show that the boundary problem suffered in existing dominant regression-based rotation …
first show that the boundary problem suffered in existing dominant regression-based rotation …
Spts v2: single-point scene text spotting
End-to-end scene text spotting has made significant progress due to its intrinsic synergy
between text detection and recognition. Previous methods commonly regard manual …
between text detection and recognition. Previous methods commonly regard manual …
Weakly supervised scene text generation for low-resource languages
A large number of annotated training images is crucial for training successful scene text
recognition models. However, collecting sufficient datasets can be a labor-intensive and …
recognition models. However, collecting sufficient datasets can be a labor-intensive and …
A decade: review of scene text detection methods
E Rainarli - Computer Science Review, 2021 - Elsevier
The rapid development of scene text detection shows us the need for text recognition in a
scene image. Road signs recognition, reading the scene image for machine translation, text …
scene image. Road signs recognition, reading the scene image for machine translation, text …