Freeseg: Unified, universal and open-vocabulary image segmentation
Recently, open-vocabulary learning has emerged to accomplish segmentation for arbitrary
categories of text-based descriptions, which popularizes the segmentation system to more …
categories of text-based descriptions, which popularizes the segmentation system to more …
A comprehensive survey on image captioning: from handcrafted to deep learning-based techniques, a taxonomy and open research issues
Image captioning is a pretty modern area of the convergence of computer vision and natural
language processing and is widely used in a range of applications such as multi-modal …
language processing and is widely used in a range of applications such as multi-modal …
Aligndet: Aligning pre-training and fine-tuning in object detection
The paradigm of large-scale pre-training followed by downstream fine-tuning has been
widely employed in various object detection algorithms. In this paper, we reveal …
widely employed in various object detection algorithms. In this paper, we reveal …
Automated radiographic report generation purely on transformer: A multicriteria supervised approach
Automated radiographic report generation is challenging in at least two aspects. First,
medical images are very similar to each other and the visual differences of clinic importance …
medical images are very similar to each other and the visual differences of clinic importance …
A thorough review of models, evaluation metrics, and datasets on image captioning
G Luo, L Cheng, C Jing, C Zhao… - IET Image Processing, 2022 - Wiley Online Library
Image captioning means generate descriptive sentences from a query image automatically.
It has recently received widespread attention from the computer vision and natural language …
It has recently received widespread attention from the computer vision and natural language …
Knowing what to learn: a metric-oriented focal mechanism for image captioning
Despite considerable progress, image captioning still suffers from the huge difference in
quality between easy and hard examples, which is left unexploited in existing methods. To …
quality between easy and hard examples, which is left unexploited in existing methods. To …
Fuzzy embedded clustering based on bipartite graph for large-scale hyperspectral image
X Yang, Y Xu, S Li, Y Liu, Y Liu - IEEE Geoscience and Remote …, 2021 - ieeexplore.ieee.org
Hyperspectral image (HSI) clustering has been widely used in the field of remote sensing.
However, most traditional clustering algorithms are not suitable for dealing with large-scale …
However, most traditional clustering algorithms are not suitable for dealing with large-scale …
Image caption generation using visual attention prediction and contextual spatial relation extraction
R Sasibhooshan, S Kumaraswamy, S Sasidharan - Journal of Big Data, 2023 - Springer
Automatic caption generation with attention mechanisms aims at generating more
descriptive captions containing coarser to finer semantic contents in the image. In this work …
descriptive captions containing coarser to finer semantic contents in the image. In this work …
Learning joint relationship attention network for image captioning
C Wang, X Gu - Expert Systems with Applications, 2023 - Elsevier
Image captioning aims at automatically describing the main content of an image with a
complete and natural sentence. Existing attention-based methods often focus on visual …
complete and natural sentence. Existing attention-based methods often focus on visual …
Transformer-based local-global guidance for image captioning
Image captioning is a difficult problem for machine learning algorithms to compress huge
amounts of images into descriptive languages. The recurrent models are popularly used as …
amounts of images into descriptive languages. The recurrent models are popularly used as …