SST: Spatial and semantic transformers for multi-label image recognition

ZM Chen, Q Cui, B Zhao, R Song… - … on Image Processing, 2022 - ieeexplore.ieee.org
Multi-label image recognition has attracted considerable research attention and achieved
great success in recent years. Capturing label correlations is an effective manner to advance …

Integrating multi-label contrastive learning with dual adversarial graph neural networks for cross-modal retrieval

S Qian, D Xue, Q Fang, C Xu - IEEE Transactions on Pattern …, 2022 - ieeexplore.ieee.org
With the growing amount of multimodal data, cross-modal retrieval has attracted more and
more attention and become a hot research topic. To date, most of the existing techniques …

Toward purifying defect feature for multilabel sewer defect classification

C Hu, B Dong, H Shao, J Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
An automatic vision-based sewer inspection plays a key role of sewage system in a modern
city. Recent advances focus on utilizing a deep learning model to realize the sewer …

Dynamic correlation learning and regularization for multi-label confidence calibration

T Chen, W Wang, T Pu, J Qin, Z Yang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Modern visual recognition models often display overconfidence due to their reliance on
complex deep neural networks and one-hot target supervision, resulting in unreliable …

Global-local label correlation for partial multi-label learning

L Sun, S Feng, J Liu, G Lyu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Partial Multi-label Learning (PML) addresses the scenario where each instance is assigned
with multiple candidate labels, while only a subset of the labels are relevant. This task is very …

A dual modality approach for (zero-shot) multi-label classification

S Xu, Y Li, J Hsiao, C Ho, Z Qi - arXiv preprint arXiv:2208.09562, 2022 - arxiv.org
In computer vision, multi-label classification, including zero-shot multi-label classification are
important tasks with many real-world applications. In this paper, we propose a novel …

Contrastively enforcing distinctiveness for multi-label image classification

SD Dao, H Zhao, D Phung, J Cai - Neurocomputing, 2023 - Elsevier
Recently, as an effective way of learning latent representations, contrastive learning has
been increasingly popular and successful in various domains. The success of contrastive …

Beyond word embeddings: Heterogeneous prior knowledge driven multi-label image classification

X Deng, S Feng, G Lyu, T Wang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Multi-Label Image Classification (MLIC) is a fundamental yet challenging task which aims to
recognize multiple labels from given images. The key to solve MLIC lies in how to accurately …

Learning label semantics for weakly supervised group activity recognition

L Wu, M Tian, Y Xiang, K Gu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Weakly supervised group activity recognition deals with the dependence on individual-level
annotations during understanding scenes involving multiple individuals, which is a …

Cross-modal cognitive consensus guided audio-visual segmentation

Z Shi, Q Wu, F Meng, L Xu, H Li - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Audio-Visual Segmentation (AVS) aims to extract the sounding object from a video frame,
which is represented by a pixel-wise segmentation mask for application scenarios such as …