Deep learning for weakly-supervised object detection and localization: A survey

F Shao, L Chen, J Shao, W Ji, S Xiao, L Ye, Y Zhuang… - Neurocomputing, 2022 - Elsevier
Abstract Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), ie.,
detecting multiple and single instances with bounding boxes in an image using image-level …

Fairness-aware contrastive learning with partially annotated sensitive attributes

F Zhang, K Kuang, L Chen, Y Liu, C Wu… - … Conference on Learning …, 2022 - openreview.net
Learning high-quality representation is important and essential for visual recognition.
Unfortunately, traditional representation learning suffers from fairness issues since the …

Active learning for point cloud semantic segmentation via spatial-structural diversity reasoning

F Shao, Y Luo, P Liu, J Chen, Y Yang, Y Lu… - Proceedings of the 30th …, 2022 - dl.acm.org
The expensive annotation cost is notoriously known as the main constraint for the
development of the point cloud semantic segmentation technique. Active learning methods …

Causal interventional training for image recognition

W Qin, H Zhang, R Hong, EP Lim… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Deep learning models often fit undesired dataset bias in training. In this paper, we formulate
the bias using causal inference, which helps us uncover the ever-elusive causalities among …

Nicest: Noisy label correction and training for robust scene graph generation

L Li, J Xiao, H Shi, H Zhang, Y Yang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Nearly all existing scene graph generation (SGG) models have overlooked the ground-truth
annotation qualities of mainstream SGG datasets, ie, they assume: 1) all the manually …

Weakly-supervised video object grounding via causal intervention

W Wang, J Gao, C Xu - IEEE Transactions on Pattern Analysis …, 2022 - ieeexplore.ieee.org
We target at the task of weakly-supervised video object grounding (WSVOG), where only
video-sentence annotations are available during model learning. It aims to localize objects …

SPA2Net: Structure-Preserved Attention Activated Network for Weakly Supervised Object Localization

D Chen, X Pan, F Tang, W Dong… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
By exploring the localizable representations in deep CNN, weakly supervised object
localization (WSOL) methods could determine the position of the object in each image just …

Fdcnet: Feature drift compensation network for class-incremental weakly supervised object localization

S Park, T Lee, Y Lee, B Kang - … of the 31st ACM International Conference …, 2023 - dl.acm.org
This work addresses the task of class-incremental weakly supervised object localization (CI-
WSOL). The goal is to incrementally learn object localization for novel classes using only …

Improving weakly supervised sound event detection with causal intervention

Y Xin, D Yang, F Cui, Y Wang… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Existing weakly supervised sound event detection (WSSED) work has not explored both
types of co-occurrences simultaneously, ie, some sound events often co-occur, and their …

Semantic-constraint matching transformer for weakly supervised object localization

Y Cao, Y Su, W Wang, Y Liu, Q Wu - arXiv preprint arXiv:2309.01331, 2023 - arxiv.org
Weakly supervised object localization (WSOL) strives to learn to localize objects with only
image-level supervision. Due to the local receptive fields generated by convolution …