Review the state-of-the-art technologies of semantic segmentation based on deep learning

Y Mo, Y Wu, X Yang, F Liu, Y Liao - Neurocomputing, 2022 - Elsevier
The goal of semantic segmentation is to segment the input image according to semantic
information and predict the semantic category of each pixel from a given label set. With the …

Deep learning for 3d point clouds: A survey

Y Guo, H Wang, Q Hu, H Liu, L Liu… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Point cloud learning has lately attracted increasing attention due to its wide applications in
many areas, such as computer vision, autonomous driving, and robotics. As a dominating …

Segment and Recognize Anything at Any Granularity

F Li, H Zhang, P Sun, X Zou, S Liu, C Li, J Yang… - … on Computer Vision, 2025 - Springer
In this work, we introduce Semantic-SAM, an augmented image segmentation foundation for
segmenting and recognizing anything at desired granularities. Compared to the …

A survey of deep learning-based object detection

L Jiao, F Zhang, F Liu, S Yang, L Li, Z Feng… - IEEE access, 2019 - ieeexplore.ieee.org
Object detection is one of the most important and challenging branches of computer vision,
which has been widely applied in people's life, such as monitoring security, autonomous …

Argoverse: 3d tracking and forecasting with rich maps

MF Chang, J Lambert, P Sangkloy… - Proceedings of the …, 2019 - openaccess.thecvf.com
We present Argoverse, a dataset designed to support autonomous vehicle perception tasks
including 3D tracking and motion forecasting. Argoverse includes sensor data collected by a …

Paco: Parts and attributes of common objects

V Ramanathan, A Kalia, V Petrovic… - Proceedings of the …, 2023 - openaccess.thecvf.com
Object models are gradually progressing from predicting just category labels to providing
detailed descriptions of object instances. This motivates the need for large datasets which …

Autoshape: Real-time shape-aware monocular 3d object detection

Z Liu, D Zhou, F Lu, J Fang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Existing deep learning-based approaches for monocular 3D object detection in autonomous
driving often model the object as a rotated 3D cuboid while the object's geometric shape has …

The apolloscape dataset for autonomous driving

X Huang, X Cheng, Q Geng, B Cao… - Proceedings of the …, 2018 - openaccess.thecvf.com
Scene parsing aims to assign a class (semantic) label for each pixel in an image. It is a
comprehensive analysis of an image. Given the rise of autonomous driving, pixel-accurate …

Tools, techniques, datasets and application areas for object detection in an image: a review

J Kaur, W Singh - Multimedia Tools and Applications, 2022 - Springer
Object detection is one of the most fundamental and challenging tasks to locate objects in
images and videos. Over the past, it has gained much attention to do more research on …

Rope3d: The roadside perception dataset for autonomous driving and monocular 3d object detection task

X Ye, M Shu, H Li, Y Shi, Y Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
Concurrent perception datasets for autonomous driving are mainly limited to frontal view
with sensors mounted on the vehicle. None of them is designed for the overlooked roadside …