Fine-grained image captioning with global-local discriminative objective

J Qin, J Wu, P Yan, M Li, R Yuxi… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recently, open-vocabulary learning has emerged to accomplish segmentation for arbitrary
categories of text-based descriptions, which popularizes the segmentation system to more …

被引用次数：64 相关文章所有 5 个版本

A comprehensive survey on image captioning: from handcrafted to deep learning-based techniques, a taxonomy and open research issues

H Sharma, D Padha - Artificial Intelligence Review, 2023 - Springer

Image captioning is a pretty modern area of the convergence of computer vision and natural
language processing and is widely used in a range of applications such as multi-modal …

被引用次数：12 相关文章所有 3 个版本

[PDF] thecvf.com

Aligndet: Aligning pre-training and fine-tuning in object detection

M Li, J Wu, X Wang, C Chen, J Qin… - Proceedings of the …, 2023 - openaccess.thecvf.com

The paradigm of large-scale pre-training followed by downstream fine-tuning has been
widely employed in various object detection algorithms. In this paper, we reveal …

被引用次数：11 相关文章所有 5 个版本

Automated radiographic report generation purely on transformer: A multicriteria supervised approach

Z Wang, H Han, L Wang, X Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Automated radiographic report generation is challenging in at least two aspects. First,
medical images are very similar to each other and the visual differences of clinic importance …

被引用次数：47 相关文章所有 4 个版本

[PDF] wiley.com Full View

A thorough review of models, evaluation metrics, and datasets on image captioning

G Luo, L Cheng, C Jing, C Zhao… - IET Image Processing, 2022 - Wiley Online Library

Image captioning means generate descriptive sentences from a query image automatically.
It has recently received widespread attention from the computer vision and natural language …

被引用次数：16 相关文章所有 4 个版本

Knowing what to learn: a metric-oriented focal mechanism for image captioning

J Ji, Y Ma, X Sun, Y Zhou, Y Wu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Despite considerable progress, image captioning still suffers from the huge difference in
quality between easy and hard examples, which is left unexploited in existing methods. To …

被引用次数：31 相关文章所有 5 个版本

Fuzzy embedded clustering based on bipartite graph for large-scale hyperspectral image

X Yang, Y Xu, S Li, Y Liu, Y Liu - IEEE Geoscience and Remote …, 2021 - ieeexplore.ieee.org

Hyperspectral image (HSI) clustering has been widely used in the field of remote sensing.
However, most traditional clustering algorithms are not suitable for dealing with large-scale …

被引用次数：59 相关文章所有 4 个版本

[PDF] springer.com

Image caption generation using visual attention prediction and contextual spatial relation extraction

R Sasibhooshan, S Kumaraswamy, S Sasidharan - Journal of Big Data, 2023 - Springer

Automatic caption generation with attention mechanisms aims at generating more
descriptive captions containing coarser to finer semantic contents in the image. In this work …

被引用次数：18 相关文章所有 7 个版本

Learning joint relationship attention network for image captioning

C Wang, X Gu - Expert Systems with Applications, 2023 - Elsevier

Image captioning aims at automatically describing the main content of an image with a
complete and natural sentence. Existing attention-based methods often focus on visual …

被引用次数：19 相关文章所有 2 个版本

Transformer-based local-global guidance for image captioning

H Parvin, AR Naghsh-Nilchi, HM Mohammadi - Expert Systems with …, 2023 - Elsevier

Image captioning is a difficult problem for machine learning algorithms to compress huge
amounts of images into descriptive languages. The recurrent models are popularly used as …

被引用次数：12 相关文章所有 2 个版本