SST: Spatial and semantic transformers for multi-label image recognition
Multi-label image recognition has attracted considerable research attention and achieved
great success in recent years. Capturing label correlations is an effective manner to advance …
great success in recent years. Capturing label correlations is an effective manner to advance …
Integrating multi-label contrastive learning with dual adversarial graph neural networks for cross-modal retrieval
With the growing amount of multimodal data, cross-modal retrieval has attracted more and
more attention and become a hot research topic. To date, most of the existing techniques …
more attention and become a hot research topic. To date, most of the existing techniques …
Toward purifying defect feature for multilabel sewer defect classification
An automatic vision-based sewer inspection plays a key role of sewage system in a modern
city. Recent advances focus on utilizing a deep learning model to realize the sewer …
city. Recent advances focus on utilizing a deep learning model to realize the sewer …
Dynamic correlation learning and regularization for multi-label confidence calibration
Modern visual recognition models often display overconfidence due to their reliance on
complex deep neural networks and one-hot target supervision, resulting in unreliable …
complex deep neural networks and one-hot target supervision, resulting in unreliable …
Global-local label correlation for partial multi-label learning
Partial Multi-label Learning (PML) addresses the scenario where each instance is assigned
with multiple candidate labels, while only a subset of the labels are relevant. This task is very …
with multiple candidate labels, while only a subset of the labels are relevant. This task is very …
A dual modality approach for (zero-shot) multi-label classification
In computer vision, multi-label classification, including zero-shot multi-label classification are
important tasks with many real-world applications. In this paper, we propose a novel …
important tasks with many real-world applications. In this paper, we propose a novel …
Contrastively enforcing distinctiveness for multi-label image classification
Recently, as an effective way of learning latent representations, contrastive learning has
been increasingly popular and successful in various domains. The success of contrastive …
been increasingly popular and successful in various domains. The success of contrastive …
Beyond word embeddings: Heterogeneous prior knowledge driven multi-label image classification
Multi-Label Image Classification (MLIC) is a fundamental yet challenging task which aims to
recognize multiple labels from given images. The key to solve MLIC lies in how to accurately …
recognize multiple labels from given images. The key to solve MLIC lies in how to accurately …
Learning label semantics for weakly supervised group activity recognition
Weakly supervised group activity recognition deals with the dependence on individual-level
annotations during understanding scenes involving multiple individuals, which is a …
annotations during understanding scenes involving multiple individuals, which is a …
Cross-modal cognitive consensus guided audio-visual segmentation
Audio-Visual Segmentation (AVS) aims to extract the sounding object from a video frame,
which is represented by a pixel-wise segmentation mask for application scenarios such as …
which is represented by a pixel-wise segmentation mask for application scenarios such as …