Prompt distribution learning
We present prompt distribution learning for effectively adapting a pre-trained vision-
language model to address downstream recognition tasks. Our method not only learns low …
language model to address downstream recognition tasks. Our method not only learns low …
Exposure normalization and compensation for multiple-exposure correction
Images captured with improper exposures usually bring unsatisfactory visual effects.
Previous works mainly focus on either underexposure or overexposure correction, resulting …
Previous works mainly focus on either underexposure or overexposure correction, resulting …
Association graph learning for multi-task classification with category shifts
In this paper, we focus on multi-task classification, where related classification tasks share
the same label space and are learned simultaneously. In particular, we tackle a new setting …
the same label space and are learned simultaneously. In particular, we tackle a new setting …
Multimatch: Multi-task learning for semi-supervised domain generalization
Domain generalization (DG) aims at learning a model on source domains to well generalize
on the unseen target domain. Although it has achieved great success, most of the existing …
on the unseen target domain. Although it has achieved great success, most of the existing …
Using semantic information for defining and detecting ood inputs
As machine learning models continue to achieve impressive performance across different
tasks, the importance of effective anomaly detection for such models has increased as well …
tasks, the importance of effective anomaly detection for such models has increased as well …
DoubleAUG: Single-domain Generalized Object Detector in Urban via Color Perturbation and Dual-style Memory
Object detection in urban scenarios is crucial for autonomous driving in intelligent traffic
systems. However, unlike conventional object detection tasks, urban-scene images vary …
systems. However, unlike conventional object detection tasks, urban-scene images vary …
Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection
Human-object interaction (HOI) detectors with popular query-transformer architecture have
achieved promising performance. However, accurately identifying uncommon visual …
achieved promising performance. However, accurately identifying uncommon visual …
Unbiased Semantic Representation Learning Based on Causal Disentanglement for Domain Generalization
Domain generalization primarily mitigates domain shift among multiple source domains,
generalizing the trained model to an unseen target domain. However, the spurious …
generalizing the trained model to an unseen target domain. However, the spurious …
Test-Time Distribution Learning Adapter for Cross-Modal Visual Reasoning
Vision-Language Pre-Trained (VLP) models, such as CLIP, have demonstrated remarkable
effectiveness in learning generic visual representations. Several approaches aim to …
effectiveness in learning generic visual representations. Several approaches aim to …