Text recognition in the wild: A survey
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …
information carried by text is important in a wide range of vision-based application …
Breast tumor localization and segmentation using machine learning techniques: Overview of datasets, findings, and methods
Abstract The Global Cancer Statistics 2020 reported breast cancer (BC) as the most
common diagnosis of cancer type. Therefore, early detection of such type of cancer would …
common diagnosis of cancer type. Therefore, early detection of such type of cancer would …
A survey on multimodal large language models
Multimodal Large Language Model (MLLM) recently has been a new rising research
hotspot, which uses powerful Large Language Models (LLMs) as a brain to perform …
hotspot, which uses powerful Large Language Models (LLMs) as a brain to perform …
Lvlm-ehub: A comprehensive evaluation benchmark for large vision-language models
Large Vision-Language Models (LVLMs) have recently played a dominant role in
multimodal vision-language learning. Despite the great success, it lacks a holistic evaluation …
multimodal vision-language learning. Despite the great success, it lacks a holistic evaluation …
Real-time scene text detection with differentiable binarization and adaptive scale fusion
Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …
in the scene text detection field, because of their superiority in detecting the text instances of …
Fourier contour embedding for arbitrary-shaped text detection
One of the main challenges for arbitrary-shaped text detection is to design a good text
instance representation that allows networks to learn diverse text geometry variances. Most …
instance representation that allows networks to learn diverse text geometry variances. Most …
Real-time scene text detection with differentiable binarization
Recently, segmentation-based methods are quite popular in scene text detection, as the
segmentation results can more accurately describe scene text of various shapes such as …
segmentation results can more accurately describe scene text of various shapes such as …
Deepsolo: Let transformer decoder with explicit points solo for text spotting
End-to-end text spotting aims to integrate scene text detection and recognition into a unified
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …
framework. Dealing with the relationship between the two sub-tasks plays a pivotal role in …
On the hidden mystery of ocr in large multimodal models
Large models have recently played a dominant role in natural language processing and
multimodal vision-language learning. However, their effectiveness in text-related visual …
multimodal vision-language learning. However, their effectiveness in text-related visual …
Swintextspotter: Scene text spotting via better synergy between text detection and text recognition
End-to-end scene text spotting has attracted great attention in recent years due to the
success of excavating the intrinsic synergy of the scene text detection and recognition …
success of excavating the intrinsic synergy of the scene text detection and recognition …