Review the state-of-the-art technologies of semantic segmentation based on deep learning
The goal of semantic segmentation is to segment the input image according to semantic
information and predict the semantic category of each pixel from a given label set. With the …
information and predict the semantic category of each pixel from a given label set. With the …
RGB-T image analysis technology and application: A survey
Abstract RGB-Thermal infrared (RGB-T) image analysis has been actively studied in recent
years. In the past decade, it has received wide attention and made a lot of important …
years. In the past decade, it has received wide attention and made a lot of important …
GMNet: Graded-feature multilabel-learning network for RGB-thermal urban scene semantic segmentation
Semantic segmentation is a fundamental task in computer vision, and it has various
applications in fields such as robotic sensing, video surveillance, and autonomous driving. A …
applications in fields such as robotic sensing, video surveillance, and autonomous driving. A …
CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers
Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …
Delivering arbitrary-modal semantic segmentation
Multimodal fusion can make semantic segmentation more robust. However, fusing an
arbitrary number of modalities remains underexplored. To delve into this problem, we create …
arbitrary number of modalities remains underexplored. To delve into this problem, we create …
RGB-T semantic segmentation with location, activation, and sharpening
Semantic segmentation is important for scene understanding. To address the scenes of
adverse illumination conditions of natural images, thermal infrared (TIR) images are …
adverse illumination conditions of natural images, thermal infrared (TIR) images are …
MTANet: Multitask-aware network with hierarchical multimodal fusion for RGB-T urban scene understanding
Understanding urban scenes is a fundamental ability requirement for assisted driving and
autonomous vehicles. Most of the available urban scene understanding methods use red …
autonomous vehicles. Most of the available urban scene understanding methods use red …
CACFNet: Cross-modal attention cascaded fusion network for RGB-T urban scene parsing
Color–thermal (RGB-T) urban scene parsing has recently attracted widespread interest.
However, most existing approaches to RGB-T urban scene parsing do not deeply explore …
However, most existing approaches to RGB-T urban scene parsing do not deeply explore …
DBCNet: Dynamic bilateral cross-fusion network for RGB-T urban scene understanding in intelligent vehicles
Understanding urban scenes is a fundamental capability required of intelligent vehicles.
Depth cues provide useful geometric information for semantic segmentation, thus …
Depth cues provide useful geometric information for semantic segmentation, thus …
MFFENet: Multiscale feature fusion and enhancement network for RGB–thermal urban road scene parsing
Compared with traditional handcrafted features, deep learning has greatly improved the
performance of scene parsing. However, it remains challenging under various …
performance of scene parsing. However, it remains challenging under various …