Unsupervised scale-consistent depth and ego-motion learning from monocular video
Recent work has shown that CNN-based depth and ego-motion estimators can be learned
using unlabelled monocular videos. However, the performance is limited by unidentified …
using unlabelled monocular videos. However, the performance is limited by unidentified …
Transformer-based attention networks for continuous pixel-wise prediction
While convolutional neural networks have shown a tremendous impact on various computer
vision tasks, they generally demonstrate limitations in explicitly modeling long-range …
vision tasks, they generally demonstrate limitations in explicitly modeling long-range …
Perception and navigation in autonomous systems in the era of learning: A survey
Autonomous systems possess the features of inferring their own state, understanding their
surroundings, and performing autonomous navigation. With the applications of learning …
surroundings, and performing autonomous navigation. With the applications of learning …
Monoindoor: Towards good practice of self-supervised monocular depth estimation for indoor environments
Self-supervised depth estimation for indoor environments is more challenging than its
outdoor counterpart in at least the following two aspects:(i) the depth range of indoor …
outdoor counterpart in at least the following two aspects:(i) the depth range of indoor …
Map-free visual relocalization: Metric pose relative to a single image
Can we relocalize in a scene represented by a single reference image? Standard visual
relocalization requires hundreds of images and scale calibration to build a scene-specific …
relocalization requires hundreds of images and scale calibration to build a scene-specific …
Edge-SLAM: Edge-assisted visual simultaneous localization and mapping
Localization in urban environments is becoming increasingly important and used in tools
such as ARCore, ARKit and others. One popular mechanism to achieve accurate indoor …
such as ARCore, ARKit and others. One popular mechanism to achieve accurate indoor …
Learning monocular visual odometry via self-supervised long-term modeling
Monocular visual odometry (VO) suffers severely from error accumulation during frame-to-
frame pose estimation. In this paper, we present a self-supervised learning method for VO …
frame pose estimation. In this paper, we present a self-supervised learning method for VO …
Planemvs: 3d plane reconstruction from multi-view stereo
We present a novel framework named PlaneMVS for 3D plane reconstruction from multiple
input views with known camera poses. Most previous learning-based plane reconstruction …
input views with known camera poses. Most previous learning-based plane reconstruction …
Can scale-consistent monocular depth be learned in a self-supervised scale-invariant manner?
Geometric constraints are shown to enforce scale consistency and remedy the scale
ambiguity issue in self-supervised monocular depth estimation. Meanwhile, scale-invariant …
ambiguity issue in self-supervised monocular depth estimation. Meanwhile, scale-invariant …
Adversarial training of self-supervised monocular depth estimation against physical-world attacks
Monocular Depth Estimation (MDE) is a critical component in applications such as
autonomous driving. There are various attacks against MDE networks. These attacks …
autonomous driving. There are various attacks against MDE networks. These attacks …