Unsupervised scale-consistent depth and ego-motion learning from monocular video

J Bian, Z Li, N Wang, H Zhan, C Shen… - Advances in neural …, 2019 - proceedings.neurips.cc
Recent work has shown that CNN-based depth and ego-motion estimators can be learned
using unlabelled monocular videos. However, the performance is limited by unidentified …

Transformer-based attention networks for continuous pixel-wise prediction

G Yang, H Tang, M Ding, N Sebe… - Proceedings of the …, 2021 - openaccess.thecvf.com
While convolutional neural networks have shown a tremendous impact on various computer
vision tasks, they generally demonstrate limitations in explicitly modeling long-range …

Perception and navigation in autonomous systems in the era of learning: A survey

Y Tang, C Zhao, J Wang, C Zhang… - … on Neural Networks …, 2022 - ieeexplore.ieee.org
Autonomous systems possess the features of inferring their own state, understanding their
surroundings, and performing autonomous navigation. With the applications of learning …

Monoindoor: Towards good practice of self-supervised monocular depth estimation for indoor environments

P Ji, R Li, B Bhanu, Y Xu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Self-supervised depth estimation for indoor environments is more challenging than its
outdoor counterpart in at least the following two aspects:(i) the depth range of indoor …

Map-free visual relocalization: Metric pose relative to a single image

E Arnold, J Wynn, S Vicente… - … on Computer Vision, 2022 - Springer
Can we relocalize in a scene represented by a single reference image? Standard visual
relocalization requires hundreds of images and scale calibration to build a scene-specific …

Edge-SLAM: Edge-assisted visual simultaneous localization and mapping

AJ Ben Ali, M Kouroshli, S Semenova… - ACM Transactions on …, 2022 - dl.acm.org
Localization in urban environments is becoming increasingly important and used in tools
such as ARCore, ARKit and others. One popular mechanism to achieve accurate indoor …

Learning monocular visual odometry via self-supervised long-term modeling

Y Zou, P Ji, QH Tran, JB Huang… - European Conference on …, 2020 - Springer
Monocular visual odometry (VO) suffers severely from error accumulation during frame-to-
frame pose estimation. In this paper, we present a self-supervised learning method for VO …

Planemvs: 3d plane reconstruction from multi-view stereo

J Liu, P Ji, N Bansal, C Cai, Q Yan… - Proceedings of the …, 2022 - openaccess.thecvf.com
We present a novel framework named PlaneMVS for 3D plane reconstruction from multiple
input views with known camera poses. Most previous learning-based plane reconstruction …

Can scale-consistent monocular depth be learned in a self-supervised scale-invariant manner?

L Wang, Y Wang, L Wang, Y Zhan… - Proceedings of the …, 2021 - openaccess.thecvf.com
Geometric constraints are shown to enforce scale consistency and remedy the scale
ambiguity issue in self-supervised monocular depth estimation. Meanwhile, scale-invariant …

Adversarial training of self-supervised monocular depth estimation against physical-world attacks

Z Cheng, J Liang, G Tao, D Liu, X Zhang - arXiv preprint arXiv:2301.13487, 2023 - arxiv.org
Monocular Depth Estimation (MDE) is a critical component in applications such as
autonomous driving. There are various attacks against MDE networks. These attacks …