Object detection using deep learning, CNNs and vision transformers: A review
Detecting objects remains one of computer vision and image understanding applications'
most fundamental and challenging aspects. Significant advances in object detection have …
most fundamental and challenging aspects. Significant advances in object detection have …
[HTML][HTML] 2D and 3D object detection algorithms from images: A Survey
W Chen, Y Li, Z Tian, F Zhang - Array, 2023 - Elsevier
Object detection is a crucial branch of computer vision that aims to locate and classify
objects in images. Using deep convolutional neural networks (CNNs) as the primary …
objects in images. Using deep convolutional neural networks (CNNs) as the primary …
Grounding dino: Marrying dino with grounded pre-training for open-set object detection
In this paper, we present an open-set object detector, called Grounding DINO, by marrying
Transformer-based detector DINO with grounded pre-training, which can detect arbitrary …
Transformer-based detector DINO with grounded pre-training, which can detect arbitrary …
Diffusiondet: Diffusion model for object detection
We propose DiffusionDet, a new framework that formulates object detection as a denoising
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …
Detrs beat yolos on real-time object detection
Y Zhao, W Lv, S Xu, J Wei, G Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
The YOLO series has become the most popular framework for real-time object detection due
to its reasonable trade-off between speed and accuracy. However we observe that the …
to its reasonable trade-off between speed and accuracy. However we observe that the …
Transfusion: Robust lidar-camera fusion for 3d object detection with transformers
LiDAR and camera are two important sensors for 3D object detection in autonomous driving.
Despite the increasing popularity of sensor fusion in this field, the robustness against inferior …
Despite the increasing popularity of sensor fusion in this field, the robustness against inferior …
Dino: Detr with improved denoising anchor boxes for end-to-end object detection
We present DINO (\textbf {D} ETR with\textbf {I} mproved de\textbf {N} oising anch\textbf {O} r
boxes), a state-of-the-art end-to-end object detector.% in this paper. DINO improves over …
boxes), a state-of-the-art end-to-end object detector.% in this paper. DINO improves over …
Petr: Position embedding transformation for multi-view 3d object detection
In this paper, we develop position embedding transformation (PETR) for multi-view 3D
object detection. PETR encodes the position information of 3D coordinates into image …
object detection. PETR encodes the position information of 3D coordinates into image …
Dn-detr: Accelerate detr training by introducing query denoising
We present in this paper a novel denoising training method to speedup DETR (DEtection
TRansformer) training and offer a deepened understanding of the slow convergence issue …
TRansformer) training and offer a deepened understanding of the slow convergence issue …
Masked-attention mask transformer for universal image segmentation
Image segmentation groups pixels with different semantics, eg, category or instance
membership. Each choice of semantics defines a task. While only the semantics of each task …
membership. Each choice of semantics defines a task. While only the semantics of each task …