Freeseg: Unified, universal and open-vocabulary image segmentation

J Qin, J Wu, P Yan, M Li, R Yuxi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, open-vocabulary learning has emerged to accomplish segmentation for arbitrary
categories of text-based descriptions, which popularizes the segmentation system to more …

A comprehensive survey on image captioning: from handcrafted to deep learning-based techniques, a taxonomy and open research issues

H Sharma, D Padha - Artificial Intelligence Review, 2023 - Springer
Image captioning is a pretty modern area of the convergence of computer vision and natural
language processing and is widely used in a range of applications such as multi-modal …

Aligndet: Aligning pre-training and fine-tuning in object detection

M Li, J Wu, X Wang, C Chen, J Qin… - Proceedings of the …, 2023 - openaccess.thecvf.com
The paradigm of large-scale pre-training followed by downstream fine-tuning has been
widely employed in various object detection algorithms. In this paper, we reveal …

Automated radiographic report generation purely on transformer: A multicriteria supervised approach

Z Wang, H Han, L Wang, X Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Automated radiographic report generation is challenging in at least two aspects. First,
medical images are very similar to each other and the visual differences of clinic importance …

A thorough review of models, evaluation metrics, and datasets on image captioning

G Luo, L Cheng, C Jing, C Zhao… - IET Image Processing, 2022 - Wiley Online Library
Image captioning means generate descriptive sentences from a query image automatically.
It has recently received widespread attention from the computer vision and natural language …

Knowing what to learn: a metric-oriented focal mechanism for image captioning

J Ji, Y Ma, X Sun, Y Zhou, Y Wu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Despite considerable progress, image captioning still suffers from the huge difference in
quality between easy and hard examples, which is left unexploited in existing methods. To …

Fuzzy embedded clustering based on bipartite graph for large-scale hyperspectral image

X Yang, Y Xu, S Li, Y Liu, Y Liu - IEEE Geoscience and Remote …, 2021 - ieeexplore.ieee.org
Hyperspectral image (HSI) clustering has been widely used in the field of remote sensing.
However, most traditional clustering algorithms are not suitable for dealing with large-scale …

Image caption generation using visual attention prediction and contextual spatial relation extraction

R Sasibhooshan, S Kumaraswamy, S Sasidharan - Journal of Big Data, 2023 - Springer
Automatic caption generation with attention mechanisms aims at generating more
descriptive captions containing coarser to finer semantic contents in the image. In this work …

Learning joint relationship attention network for image captioning

C Wang, X Gu - Expert Systems with Applications, 2023 - Elsevier
Image captioning aims at automatically describing the main content of an image with a
complete and natural sentence. Existing attention-based methods often focus on visual …

Transformer-based local-global guidance for image captioning

H Parvin, AR Naghsh-Nilchi, HM Mohammadi - Expert Systems with …, 2023 - Elsevier
Image captioning is a difficult problem for machine learning algorithms to compress huge
amounts of images into descriptive languages. The recurrent models are popularly used as …