A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks with different data modalities. A PFM (eg, BERT, ChatGPT, and GPT-4) is …

Advances in medical image analysis with vision transformers: a comprehensive review

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier
The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

Exploring plain vision transformer backbones for object detection

Y Li, H Mao, R Girshick, K He - European conference on computer vision, 2022 - Springer
We explore the plain, non-hierarchical Vision Transformer (ViT) as a backbone network for
object detection. This design enables the original ViT architecture to be fine-tuned for object …

YOLOv5-Tassel: Detecting tassels in RGB UAV imagery with improved YOLOv5 based on transfer learning

W Liu, K Quijano, MM Crawford - IEEE Journal of Selected …, 2022 - ieeexplore.ieee.org
Unmanned aerial vehicles (UAVs) equipped with lightweight sensors, such as RGB cameras
and LiDAR, have significant potential in precision agriculture, including object detection …

Anchor detr: Query design for transformer-based detector

Y Wang, X Zhang, T Yang, J Sun - … of the AAAI conference on artificial …, 2022 - ojs.aaai.org
In this paper, we propose a novel query design for the transformer-based object detection. In
previous transformer-based detectors, the object queries are a set of learned embeddings …

YOLOWeeds: A novel benchmark of YOLO object detectors for multi-class weed detection in cotton production systems

F Dang, D Chen, Y Lu, Z Li - Computers and Electronics in Agriculture, 2023 - Elsevier
Weeds are among the major threats to cotton production. Overreliance on herbicides for
weed control has accelerated the evolution of herbicide-resistance in weeds and caused …

Rethinking efficient lane detection via curve modeling

Z Feng, S Guo, X Tan, K Xu… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper presents a novel parametric curve-based method for lane detection in RGB
images. Unlike state-of-the-art segmentation-based and point detection-based methods that …

RFLA: Gaussian receptive field based label assignment for tiny object detection

C Xu, J Wang, W Yang, H Yu, L Yu, GS Xia - European conference on …, 2022 - Springer
Detecting tiny objects is one of the main obstacles hindering the development of object
detection. The performance of generic object detectors tends to drastically deteriorate on tiny …

PillarNeXt: Rethinking network designs for 3D object detection in LiDAR point clouds

J Li, C Luo, X Yang - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
In order to deal with the sparse and unstructured raw point clouds, most LiDAR based 3D
object detection research focuses on designing dedicated local point aggregators for fine …

Sparse instance activation for real-time instance segmentation

T Cheng, X Wang, S Chen, W Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we propose a conceptually novel, efficient, and fully convolutional framework
for real-time instance segmentation. Previously, most instance segmentation methods …