Review the state-of-the-art technologies of semantic segmentation based on deep learning

Y Mo, Y Wu, X Yang, F Liu, Y Liao - Neurocomputing, 2022 - Elsevier
The goal of semantic segmentation is to segment the input image according to semantic
information and predict the semantic category of each pixel from a given label set. With the …

RGB-T image analysis technology and application: A survey

K Song, Y Zhao, L Huang, Y Yan, Q Meng - Engineering Applications of …, 2023 - Elsevier
Abstract RGB-Thermal infrared (RGB-T) image analysis has been actively studied in recent
years. In the past decade, it has received wide attention and made a lot of important …

GMNet: Graded-feature multilabel-learning network for RGB-thermal urban scene semantic segmentation

W Zhou, J Liu, J Lei, L Yu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Semantic segmentation is a fundamental task in computer vision, and it has various
applications in fields such as robotic sensing, video surveillance, and autonomous driving. A …

CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers

J Zhang, H Liu, K Yang, X Hu, R Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …

Delivering arbitrary-modal semantic segmentation

J Zhang, R Liu, H Shi, K Yang, S Reiß… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multimodal fusion can make semantic segmentation more robust. However, fusing an
arbitrary number of modalities remains underexplored. To delve into this problem, we create …

RGB-T semantic segmentation with location, activation, and sharpening

G Li, Y Wang, Z Liu, X Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Semantic segmentation is important for scene understanding. To address the scenes of
adverse illumination conditions of natural images, thermal infrared (TIR) images are …

MTANet: Multitask-aware network with hierarchical multimodal fusion for RGB-T urban scene understanding

W Zhou, S Dong, J Lei, L Yu - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org
Understanding urban scenes is a fundamental ability requirement for assisted driving and
autonomous vehicles. Most of the available urban scene understanding methods use red …

CACFNet: Cross-modal attention cascaded fusion network for RGB-T urban scene parsing

W Zhou, S Dong, M Fang, L Yu - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Color–thermal (RGB-T) urban scene parsing has recently attracted widespread interest.
However, most existing approaches to RGB-T urban scene parsing do not deeply explore …

DBCNet: Dynamic bilateral cross-fusion network for RGB-T urban scene understanding in intelligent vehicles

W Zhou, T Gong, J Lei, L Yu - IEEE Transactions on Systems …, 2023 - ieeexplore.ieee.org
Understanding urban scenes is a fundamental capability required of intelligent vehicles.
Depth cues provide useful geometric information for semantic segmentation, thus …

MFFENet: Multiscale feature fusion and enhancement network for RGB–thermal urban road scene parsing

W Zhou, X Lin, J Lei, L Yu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Compared with traditional handcrafted features, deep learning has greatly improved the
performance of scene parsing. However, it remains challenging under various …