Change detection methods for remote sensing in the last decade: A comprehensive review

G Cheng, Y Huang, X Li, S Lyu, Z Xu, H Zhao, Q Zhao… - Remote Sensing, 2024 - mdpi.com
Change detection is an essential and widely utilized task in remote sensing that aims to
detect and analyze changes occurring in the same geographical area over time, which has …

Review of lightweight deep convolutional neural networks

F Chen, S Li, J Han, F Ren, Z Yang - Archives of Computational Methods …, 2024 - Springer
Lightweight deep convolutional neural networks (LDCNNs) are vital components of mobile
intelligence, particularly in mobile vision. Although various heavy networks with increasingly …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Jotr: 3d joint contrastive learning with transformers for occluded human mesh recovery

J Li, Z Yang, X Wang, J Ma, C Zhou… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this study, we focus on the problem of 3D human mesh recovery from a single image
under obscured conditions. Most state-of-the-art methods aim to improve 2D alignment …

Betrayed by captions: Joint caption grounding and generation for open vocabulary instance segmentation

J Wu, X Li, H Ding, X Li, G Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this work, we focus on open vocabulary instance segmentation to expand a segmentation
model to classify and segment instance-level novel categories. Previous approaches have …

Deepfake generation and detection: A benchmark and survey

G Pei, J Zhang, M Hu, Z Zhang, C Wang, Y Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Deepfake is a technology dedicated to creating highly realistic facial images and videos
under specific conditions, which has significant application potential in fields such as …

Clip-ad: A language-guided staged dual-path model for zero-shot anomaly detection

X Chen, J Zhang, G Tian, H He, W Zhang… - … Joint Conference on …, 2024 - Springer
This paper considers zero-shot Anomaly Detection (AD), performing AD without reference
images of the test objects. We propose a framework called CLIP-AD to leverage the zero …

A diffusion-based framework for multi-class anomaly detection

H He, J Zhang, H Chen, X Chen, Z Li, X Chen… - Proceedings of the …, 2024 - ojs.aaai.org
Reconstruction-based approaches have achieved remarkable outcomes in anomaly
detection. The exceptional image reconstruction capabilities of recently popular diffusion …

Towards robust referring image segmentation

J Wu, X Li, X Li, H Ding, Y Tong… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Referring Image Segmentation (RIS) is a fundamental vision-language task that outputs
object masks based on text descriptions. Many works have achieved considerable progress …

Edgesam: Prompt-in-the-loop distillation for on-device deployment of sam

C Zhou, X Li, CC Loy, B Dai - arXiv preprint arXiv:2312.06660, 2023 - arxiv.org
This paper presents EdgeSAM, an accelerated variant of the Segment Anything Model
(SAM), optimized for efficient execution on edge devices with minimal compromise in …