Image segmentation using deep learning: A survey
Image segmentation is a key task in computer vision and image processing with important
applications such as scene understanding, medical image analysis, robotic perception …
applications such as scene understanding, medical image analysis, robotic perception …
BIM, machine learning and computer vision techniques in underground construction: Current status and future perspectives
The architecture, engineering and construction (AEC) industry is experiencing a
technological revolution driven by booming digitisation and automation. Advances in …
technological revolution driven by booming digitisation and automation. Advances in …
Openoccupancy: A large scale benchmark for surrounding semantic occupancy perception
Semantic occupancy perception is essential for autonomous driving, as automated vehicles
require a fine-grained perception of the 3D urban structures. However, existing relevant …
require a fine-grained perception of the 3D urban structures. However, existing relevant …
Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d
For the last few decades, several major subfields of artificial intelligence including computer
vision, graphics, and robotics have progressed largely independently from each other …
vision, graphics, and robotics have progressed largely independently from each other …
Mvimgnet: A large-scale dataset of multi-view images
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
Regtr: End-to-end point cloud correspondences with transformers
Despite recent success in incorporating learning into point cloud registration, many works
focus on learning feature descriptors and continue to rely on nearest-neighbor feature …
focus on learning feature descriptors and continue to rely on nearest-neighbor feature …
Unifying flow, stereo and depth estimation
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives
Abstract We present Ego-Exo4D a diverse large-scale multimodal multiview video dataset
and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …
and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …
Adabins: Depth estimation using adaptive bins
We address the problem of estimating a high quality dense depth map from a single RGB
input image. We start out with a baseline encoder-decoder convolutional neural network …
input image. We start out with a baseline encoder-decoder convolutional neural network …
Predator: Registration of 3d point clouds with low overlap
We introduce PREDATOR, a model for pairwise pointcloud registration with deep attention
to the overlap region. Different from previous work, our model is specifically designed to …
to the overlap region. Different from previous work, our model is specifically designed to …