[PDF][PDF] YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems

CY Wang, HYM Liao - APSIPA Transactions on Signal and …, 2024 - nowpublishers.com
This is a comprehensive review of the YOLO series of systems. Different from previous
literature surveys, this review article reexamines the characteristics of the YOLO series from …

Foundation models in robotics: Applications, challenges, and the future

R Firoozi, J Tucker, S Tian… - … Journal of Robotics …, 2023 - journals.sagepub.com
We survey applications of pretrained foundation models in robotics. Traditional deep
learning models in robotics are trained on small datasets tailored for specific tasks, which …

Open-vocabulary sam: Segment and recognize twenty-thousand classes interactively

H Yuan, X Li, C Zhou, Y Li, K Chen, CC Loy - European Conference on …, 2025 - Springer
Abstract The CLIP and Segment Anything Model (SAM) are remarkable vision foundation
models (VFMs). SAM excels in segmentation tasks across diverse domains, whereas CLIP is …

Cnos: A strong baseline for cad-based novel object segmentation

VN Nguyen, T Groueix, G Ponimatkin… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a simple yet powerful method to segment novel objects in RGB images from
their CAD models. Leveraging recent foundation models, Segment Anything and DINOv2 …

IRSAM: Advancing segment anything model for infrared small target detection

M Zhang, Y Wang, J Guo, Y Li, X Gao… - European Conference on …, 2025 - Springer
Abstract The recent Segment Anything Model (SAM) is a significant advancement in natural
image segmentation, exhibiting potent zero-shot performance suitable for various …

Open-vocabulary semantic segmentation with decoupled one-pass network

C Han, Y Zhong, D Li, K Han… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Recently, the open-vocabulary semantic segmentation problem has attracted increasing
attention and the best performing methods are based on two-stream networks: one stream …

Advances and challenges in deep learning-based change detection for remote sensing images: A review through various learning paradigms

L Wang, M Zhang, X Gao, W Shi - Remote Sensing, 2024 - mdpi.com
Change detection (CD) in remote sensing (RS) imagery is a pivotal method for detecting
changes in the Earth's surface, finding wide applications in urban planning, disaster …

Sam-assisted remote sensing imagery semantic segmentation with object and boundary constraints

X Ma, Q Wu, X Zhao, X Zhang, MO Pun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Semantic segmentation of remote sensing imagery plays a pivotal role in extracting precise
information for diverse downstream applications. Recent development of the segment …

Adapting segment anything model for change detection in VHR remote sensing images

L Ding, K Zhu, D Peng, H Tang, K Yang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Vision foundation models (VFMs), such as the segment anything model (SAM), allow zero-
shot or interactive segmentation of visual contents; thus, they are quickly applied in a variety …

Drive anywhere: Generalizable end-to-end autonomous driving with multi-modal foundation models

TH Wang, A Maalouf, W Xiao, Y Ban… - … on Robotics and …, 2024 - ieeexplore.ieee.org
As autonomous driving technology matures, end-to-end methodologies have emerged as a
leading strategy, promising seamless integration from perception to control via deep …