Computer vision for autonomous vehicles: Problems, datasets and state of the art
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …
Deep multimodal fusion for semantic image segmentation: A survey
Recent advances in deep learning have shown excellent performance in various scene
understanding tasks. However, in some complex environments or under challenging …
understanding tasks. However, in some complex environments or under challenging …
Segnext: Rethinking convolutional attention design for semantic segmentation
We present SegNeXt, a simple convolutional network architecture for semantic
segmentation. Recent transformer-based models have dominated the field of se-mantic …
segmentation. Recent transformer-based models have dominated the field of se-mantic …
Transformer-based visual segmentation: A survey
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …
segments or groups. This technique has numerous real-world applications, such as …
Segvit: Semantic segmentation with plain vision transformers
We explore the capability of plain Vision Transformers (ViTs) for semantic segmentation and
propose the SegViT. Previous ViT-based segmentation networks usually learn a pixel-level …
propose the SegViT. Previous ViT-based segmentation networks usually learn a pixel-level …
Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers
Most recent semantic segmentation methods adopt a fully-convolutional network (FCN) with
an encoder-decoder architecture. The encoder progressively reduces the spatial resolution …
an encoder-decoder architecture. The encoder progressively reduces the spatial resolution …
Towards open vocabulary learning: A survey
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …
advancements in various core tasks like segmentation, tracking, and detection. However …
Rpvnet: A deep and efficient range-point-voxel fusion network for lidar point cloud segmentation
Point clouds can be represented in many forms (views), typically, point-based sets, voxel-
based cells or range-based images (ie, panoramic view). The point-based view is …
based cells or range-based images (ie, panoramic view). The point-based view is …
Axial-deeplab: Stand-alone axial-attention for panoptic segmentation
Convolution exploits locality for efficiency at a cost of missing long range context. Self-
attention has been adopted to augment CNNs with non-local interactions. Recent works …
attention has been adopted to augment CNNs with non-local interactions. Recent works …
Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation
In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …