Text recognition in the wild: A survey

X Chen, L Jin, Y Zhu, C Luo, T Wang - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …

Breast tumor localization and segmentation using machine learning techniques: Overview of datasets, findings, and methods

R Ranjbarzadeh, S Dorosti, SJ Ghoushchi… - Computers in Biology …, 2023 - Elsevier
Abstract The Global Cancer Statistics 2020 reported breast cancer (BC) as the most
common diagnosis of cancer type. Therefore, early detection of such type of cancer would …

A survey on multimodal large language models

S Yin, C Fu, S Zhao, K Li, X Sun, T Xu… - arXiv preprint arXiv …, 2023 - arxiv.org
Multimodal Large Language Model (MLLM) recently has been a new rising research
hotspot, which uses powerful Large Language Models (LLMs) as a brain to perform …

Lvlm-ehub: A comprehensive evaluation benchmark for large vision-language models

P Xu, W Shao, K Zhang, P Gao, S Liu… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Large Vision-Language Models (LVLMs) have recently played a dominant role in
multimodal vision-language learning. Despite the great success, it lacks a holistic evaluation …

Real-time scene text detection with differentiable binarization and adaptive scale fusion

M Liao, Z Zou, Z Wan, C Yao… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …

Fourier contour embedding for arbitrary-shaped text detection

Y Zhu, J Chen, L Liang, Z Kuang… - Proceedings of the …, 2021 - openaccess.thecvf.com
One of the main challenges for arbitrary-shaped text detection is to design a good text
instance representation that allows networks to learn diverse text geometry variances. Most …

Real-time scene text detection with differentiable binarization

M Liao, Z Wan, C Yao, K Chen, X Bai - Proceedings of the AAAI …, 2020 - ojs.aaai.org
Recently, segmentation-based methods are quite popular in scene text detection, as the
segmentation results can more accurately describe scene text of various shapes such as …

Deepsolo: Let transformer decoder with explicit points solo for text spotting

M Ye, J Zhang, S Zhao, J Liu, T Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
End-to-end text spotting aims to integrate scene text detection and recognition into a unified
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …

On the hidden mystery of ocr in large multimodal models

Y Liu, Z Li, B Yang, C Li, X Yin, C Liu, L Jin… - arXiv preprint arXiv …, 2023 - arxiv.org
Large models have recently played a dominant role in natural language processing and
multimodal vision-language learning. However, their effectiveness in text-related visual …

Swintextspotter: Scene text spotting via better synergy between text detection and text recognition

M Huang, Y Liu, Z Peng, C Liu, D Lin… - proceedings of the …, 2022 - openaccess.thecvf.com
End-to-end scene text spotting has attracted great attention in recent years due to the
success of excavating the intrinsic synergy of the scene text detection and recognition …