Prompt distribution learning

Y Lu, J Liu, Y Zhang, Y Liu… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We present prompt distribution learning for effectively adapting a pre-trained vision-
language model to address downstream recognition tasks. Our method not only learns low …

Exposure normalization and compensation for multiple-exposure correction

J Huang, Y Liu, X Fu, M Zhou, Y Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Images captured with improper exposures usually bring unsatisfactory visual effects.
Previous works mainly focus on either underexposure or overexposure correction, resulting …

Association graph learning for multi-task classification with category shifts

J Shen, Z Xiao, X Zhen, C Snoek… - Advances in Neural …, 2022 - proceedings.neurips.cc
In this paper, we focus on multi-task classification, where related classification tasks share
the same label space and are learned simultaneously. In particular, we tackle a new setting …

Multimatch: Multi-task learning for semi-supervised domain generalization

L Qi, H Yang, Y Shi, X Geng - ACM Transactions on Multimedia …, 2024 - dl.acm.org
Domain generalization (DG) aims at learning a model on source domains to well generalize
on the unseen target domain. Although it has achieved great success, most of the existing …

Using semantic information for defining and detecting ood inputs

R Kaur, X Ji, S Dutta, M Caprio, Y Yang… - arXiv preprint arXiv …, 2023 - arxiv.org
As machine learning models continue to achieve impressive performance across different
tasks, the importance of effective anomaly detection for such models has increased as well …

DoubleAUG: Single-domain Generalized Object Detector in Urban via Color Perturbation and Dual-style Memory

L Qi, P Dong, T Xiong, H Xue, X Geng - ACM Transactions on Multimedia …, 2024 - dl.acm.org
Object detection in urban scenarios is crucial for autonomous driving in intelligent traffic
systems. However, unlike conventional object detection tasks, urban-scene images vary …

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection

M Jia, L Zhao, G Li, Y Zheng - arXiv preprint arXiv:2412.08506, 2024 - arxiv.org
Human-object interaction (HOI) detectors with popular query-transformer architecture have
achieved promising performance. However, accurately identifying uncommon visual …

Unbiased Semantic Representation Learning Based on Causal Disentanglement for Domain Generalization

X Jin, N Li, W Kong, J Tang, B Yang - ACM Transactions on Multimedia …, 2024 - dl.acm.org
Domain generalization primarily mitigates domain shift among multiple source domains,
generalizing the trained model to an unseen target domain. However, the spurious …

Test-Time Distribution Learning Adapter for Cross-Modal Visual Reasoning

Y Zhang, C Zhang - ICASSP 2024-2024 IEEE International …, 2024 - ieeexplore.ieee.org
Vision-Language Pre-Trained (VLP) models, such as CLIP, have demonstrated remarkable
effectiveness in learning generic visual representations. Several approaches aim to …