Pyramid vision transformer: A versatile backbone for dense prediction without convolutions

W Wang, E Xie, X Li, DP Fan, K Song… - Proceedings of the …, 2021 - openaccess.thecvf.com
Although convolutional neural networks (CNNs) have achieved great success in computer
vision, this work investigates a simpler, convolution-free backbone network useful for many …

MPDIoU: a loss for efficient and accurate bounding box regression

S Ma, Y Xu - arXiv preprint arXiv:2307.07662, 2023 - arxiv.org
Bounding box regression (BBR) has been widely used in object detection and instance
segmentation, which is an important step in object localization. However, most of the existing …

Deepsolo: Let transformer decoder with explicit points solo for text spotting

M Ye, J Zhang, S Zhao, J Liu, T Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
End-to-end text spotting aims to integrate scene text detection and recognition into a unified
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …

Swintextspotter: Scene text spotting via better synergy between text detection and text recognition

M Huang, Y Liu, Z Peng, C Liu, D Lin… - proceedings of the …, 2022 - openaccess.thecvf.com
End-to-end scene text spotting has attracted great attention in recent years due to the
success of excavating the intrinsic synergy of the scene text detection and recognition …

Estextspotter: Towards better scene text spotting with explicit synergy in transformer

M Huang, J Zhang, D Peng, H Lu… - Proceedings of the …, 2023 - openaccess.thecvf.com
In recent years, end-to-end scene text spotting approaches are evolving to the Transformer-
based framework. While previous studies have shown the crucial importance of the intrinsic …

Abinet++: Autonomous, bidirectional and iterative language modeling for scene text spotting

S Fang, Z Mao, H Xie, Y Wang, C Yan… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Scene text spotting is of great importance to the computer vision community due to its wide
variety of applications. Recent methods attempt to introduce linguistic knowledge for …

On the arbitrary-oriented object detection: Classification based approaches revisited

X Yang, J Yan - International Journal of Computer Vision, 2022 - Springer
Arbitrary-oriented object detection has been a building block for rotation sensitive tasks. We
first show that the boundary problem suffered in existing dominant regression-based rotation …

Spts v2: single-point scene text spotting

Y Liu, J Zhang, D Peng, M Huang… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
End-to-end scene text spotting has made significant progress due to its intrinsic synergy
between text detection and recognition. Previous methods commonly regard manual …

Weakly supervised scene text generation for low-resource languages

Y Xie, X Chen, H Zhan, P Shivakumara, B Yin… - Expert Systems with …, 2024 - Elsevier
A large number of annotated training images is crucial for training successful scene text
recognition models. However, collecting sufficient datasets can be a labor-intensive and …

A decade: review of scene text detection methods

E Rainarli - Computer Science Review, 2021 - Elsevier
The rapid development of scene text detection shows us the need for text recognition in a
scene image. Road signs recognition, reading the scene image for machine translation, text …