Repvit: Revisiting mobile cnn from vit perspective

A Wang, H Chen, Z Lin, J Han… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Recently lightweight Vision Transformers (ViTs) demonstrate superior performance
and lower latency compared with lightweight Convolutional Neural Networks (CNNs) on …

Pyra: Parallel yielding re-activation for training-inference efficient task adaptation

Y Xiong, H Chen, T Hao, Z Lin, J Han, Y Zhang… - … on Computer Vision, 2025 - Springer
Recently, the scale of transformers has grown rapidly, which introduces considerable
challenges in terms of training overhead and inference efficiency in the scope of task …

A comprehensive survey on deep active learning in medical image analysis

H Wang, Q Jin, S Li, S Liu, M Wang, Z Song - Medical Image Analysis, 2024 - Elsevier
Deep learning has achieved widespread success in medical image analysis, leading to an
increasing demand for large-scale expert-annotated medical image datasets. Yet, the high …

A comprehensive survey on deep active learning and its applications in medical image analysis

H Wang, Q Jin, S Li, S Liu, M Wang, Z Song - arXiv preprint arXiv …, 2023 - arxiv.org
Deep learning has achieved widespread success in medical image analysis, leading to an
increasing demand for large-scale expert-annotated medical image datasets. Yet, the high …

Quantized prompt for efficient generalization of vision-language models

T Hao, X Ding, J Feng, Y Yang, H Chen… - European Conference on …, 2025 - Springer
In the past few years, large-scale pre-trained vision-language models like CLIP have
achieved tremendous success in various fields. Naturally, how to transfer the rich knowledge …

AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving

M Liang, JC Su, S Schulter, S Garg… - Proceedings of the …, 2024 - openaccess.thecvf.com
Autonomous vehicle (AV) systems rely on robust perception models as a cornerstone of
safety assurance. However objects encountered on the road exhibit a long-tailed distribution …

Exploring diversity-based active learning for 3d object detection in autonomous driving

J Lin, Z Liang, S Deng, L Cai, T Jiang… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
3D object detection has recently received much attention due to its great potential in
autonomous vehicle (AV). The success of deep learning based object detectors relies on the …

Plug and play active learning for object detection

C Yang, L Huang, EJ Crowley - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Annotating datasets for object detection is an expensive and time-consuming endeavor. To
minimize this burden active learning (AL) techniques are employed to select the most …

Llmi3d: Empowering llm with 3d perception from a single 2d image

F Yang, S Zhao, Y Zhang, H Chen, H Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in autonomous driving, augmented reality, robotics, and embodied
intelligence have necessitated 3D perception algorithms. However, current 3D perception …

Learn from the learnt: source-free active domain adaptation via contrastive sampling and visual persistence

M Lyu, T Hao, X Xu, H Chen, Z Lin, J Han… - European Conference on …, 2025 - Springer
Abstract Domain Adaptation (DA) facilitates knowledge transfer from a source domain to a
related target domain. This paper investigates a practical DA paradigm, namely Source data …