Monocular depth estimation using deep learning: A review

A Masoumian, HA Rashwan, J Cristiano, MS Asif… - Sensors, 2022 - mdpi.com
In current decades, significant advancements in robotics engineering and autonomous
vehicles have improved the requirement for precise depth measurements. Depth estimation …

Joint learning of frequency and spatial domains for dense image prediction

S Jia, W Yao - ISPRS Journal of Photogrammetry and Remote …, 2023 - Elsevier
Current artificial neural networks mainly conduct the learning process in the spatial domain
but neglect the frequency domain learning. However, the learning course performed in the …

DMRVisNet: Deep multihead regression network for pixel-wise visibility estimation under foggy weather

J You, S Jia, X Pei, D Yao - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org
Scene perception is essential for driving decision-making and traffic safety. However, fog, as
a kind of common weather, frequently appears in the real world, especially in mountain …

Rtia-mono: real-time lightweight self-supervised monocular depth estimation with global-local information aggregation

B Zhao, H He, H Xu, P Shi, X Hao, G Huang - Digital Signal Processing, 2025 - Elsevier
Self-supervised monocular depth estimation has attracted significant attention in computer
vision, especially for applications like autonomous driving and robotics. Recently, CNNs and …

LDA-Mono: A lightweight dual aggregation network for self-supervised monocular depth estimation

B Zhao, H He, H Xu, P Shi, X Hao, G Huang - Knowledge-Based Systems, 2024 - Elsevier
Monocular depth estimation plays a crucial role in various computer vision and robotics
applications, particularly in self-supervised methods that do not require ground-truth depth …

DERNet: driver emotion recognition using onboard camera

D Wang, S Jia, X Pei, C Han, D Yao… - IEEE Intelligent …, 2023 - ieeexplore.ieee.org
Driver emotion is considered an essential factor associated with driving behaviors and thus
influences traffic safety. Dynamically and accurately recognizing the emotions of drivers …

GFA-SMT: Geometric Feature Aggregation and Self-Attention in a Multi-Head Transformer for 3D Object Detection in Autonomous Vehicles

H Mushtaq, X Deng, P Jiang, S Wan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
3D object detection by autonomous vehicles is integral to intelligent transportation. Existing
systems often compromise essential foreground point features and local spatial interactions …

[HTML][HTML] Self-supervised multi-task learning framework for safety and health-oriented road environment surveillance based on connected vehicle visual perception

S Jia, W Yao - International Journal of Applied Earth Observation and …, 2024 - Elsevier
Cutting-edge connected vehicle (CV) technologies have drawn much attention in recent
years. The real-time traffic data captured by a CV can be shared with other CVs and data …

Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions

M Hariat, A Manzanera, D Filliat - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We present CoopNet, an approach that improves the cooperation of co-trained networks by
dynamically adapting the apportionment of gradient, to ensure equitable learning progress …

Self-Supervised Monocular Depth Estimation via Binocular Geometric Correlation Learning

B Peng, L Sun, J Lei, B Liu, H Shen, W Li… - ACM Transactions on …, 2024 - dl.acm.org
Monocular depth estimation aims to infer a depth map from a single image. Although
supervised learning-based methods have achieved remarkable performance, they generally …