End-to-end video text detection with online tracking
Text in videos usually acts as important semantic cues, which is helpful to video analysis.
Video text detection is considered as one of the most difficult tasks in document analysis due …
Video text detection is considered as one of the most difficult tasks in document analysis due …
Semantic-aware video text detection
Most existing video text detection methods track texts with appearance features, which are
easily influenced by the change of perspective and illumination. Compared with appearance …
easily influenced by the change of perspective and illumination. Compared with appearance …
End-to-end video text spotting with transformer
Recent video text spotting methods usually require the three-staged pipeline, ie, detecting
text in individual images, recognizing localized text, tracking text streams with post …
text in individual images, recognizing localized text, tracking text streams with post …
Free: A fast and robust end-to-end video text spotter
Currently, video text spotting tasks usually fall into the four-staged pipeline: detecting text
regions in individual images, recognizing localized text regions frame-wisely, tracking text …
regions in individual images, recognizing localized text regions frame-wisely, tracking text …
Video text tracking with a spatio-temporal complementary model
Y Gao, X Li, J Zhang, Y Zhou, D Jin… - … on Image Processing, 2021 - ieeexplore.ieee.org
Text tracking is to track multiple texts in a video, and construct a trajectory for each text.
Existing methods tackle this task by utilizing the tracking-by-detection framework, ie …
Existing methods tackle this task by utilizing the tracking-by-detection framework, ie …
You only recognize once: Towards fast video text spotting
Video text spotting is still an important research topic due to its various real-applications.
Previous approaches usually fall into the four-staged pipeline: text detection in individual …
Previous approaches usually fall into the four-staged pipeline: text detection in individual …
Real-time end-to-end video text spotter with contrastive representation learning
Video text spotting (VTS) is the task that requires simultaneously detecting, tracking and
recognizing text in the video. Existing video text spotting methods typically develop …
recognizing text in the video. Existing video text spotting methods typically develop …
Scale-residual learning network for scene text detection
Detecting incidentally captured text in the wild remains an open problem due to challenging
factors including unconstrained scenarios and large scale variation. In this paper, we …
factors including unconstrained scenarios and large scale variation. In this paper, we …
A new deep wavefront based model for text localization in 3D video
L Nandanwar, P Shivakumara… - … on Circuits and …, 2021 - ieeexplore.ieee.org
With the evolution of electronic devices, such as 3D cameras, addressing the challenges of
text localization in 3D video (eg, for indexing) is increasingly drawing the attention of the …
text localization in 3D video (eg, for indexing) is increasingly drawing the attention of the …
Towards accurate video text spotting with text-wise semantic reasoning
Video text spotting (VTS) aims at extracting texts from videos, where text detection, tracking
and recognition are conducted simultaneously. There have been some works that can tackle …
and recognition are conducted simultaneously. There have been some works that can tackle …