Seed: Semantics enhanced encoder-decoder framework for scene text recognition

Z Qiao, Y Zhou, D Yang, Y Zhou… - Proceedings of the …, 2020 - openaccess.thecvf.com
Scene text recognition is a hot research topic in computer vision. Recently, many recognition
methods based on the encoder-decoder framework have been proposed, and they can …

Pimnet: a parallel, iterative and mimicking network for scene text recognition

Z Qiao, Y Zhou, J Wei, W Wang, Y Zhang… - Proceedings of the 29th …, 2021 - dl.acm.org
Nowadays, scene text recognition has attracted more and more attention due to its various
applications. Most state-of-the-art methods adopt an encoder-decoder framework with …

Dense semantic contrast for self-supervised visual representation learning

X Li, Y Zhou, Y Zhang, A Zhang, W Wang… - Proceedings of the 29th …, 2021 - dl.acm.org
Self-supervised representation learning for visual pre-training has achieved remarkable
success with sample (instance or pixel) discrimination and semantics discovery of instance …

Beyond ocr+ vqa: involving ocr into the flow for robust and accurate textvqa

G Zeng, Y Zhang, Y Zhou, X Yang - Proceedings of the 29th ACM …, 2021 - dl.acm.org
Text-based visual question answering (TextVQA) requires analyzing both the visual contents
and texts in an image to answer a question, which is more practical than general visual …

Mask is all you need: Rethinking mask R-CNN for dense and arbitrary-shaped scene text detection

X Qin, Y Zhou, Y Guo, D Wu, Z Tian, N Jiang… - Proceedings of the 29th …, 2021 - dl.acm.org
Due to the large success in object detection and instance segmentation, Mask R-CNN
attracts great attention and is widely adopted as a strong baseline for arbitrary-shaped …

Towards robust real-time scene text detection: From semantic to instance representation learning

X Qin, P Lyu, C Zhang, Y Zhou, K Yao… - Proceedings of the 31st …, 2023 - dl.acm.org
Due to the flexible representation of arbitrary-shaped scene text and simple pipeline, bottom-
up segmentation-based methods begin to be mainstream in real-time scene text detection …

Textblock: Towards scene text spotting without fine-grained detection

J Wei, Y Zhang, Y Zhou, G Zeng, Z Qiao… - Proceedings of the 30th …, 2022 - dl.acm.org
Scene text spotting systems which integrate text detection and recognition modules have
witnessed a lot of success in recent years. Existing works mostly follow the framework of …

Tpsnet: Reverse thinking of thin plate splines for arbitrary shape scene text representation

W Wang, Y Zhou, J Lv, D Wu, G Zhao, N Jiang… - Proceedings of the 30th …, 2022 - dl.acm.org
The research focus of scene text detection and recognition has shifted to arbitrary shape text
in recent years, where the text shape representation is a fundamental problem. An ideal …

RD-IOD: Two-level residual-distillation-based triple-network for incremental object detection

D Yang, Y Zhou, W Shi, D Wu, W Wang - ACM Transactions on …, 2022 - dl.acm.org
As a basic component in multimedia applications, object detectors are generally trained on a
fixed set of classes that are pre-defined. However, new object classes often emerge after the …

Text growing on leaf

C Yang, M Chen, Y Yuan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Irregular-shaped texts bring challenges to Scene Text Detection (STD). Although existing
regression-based approaches achieve comparable performances, they fail to cover some …