Computer vision applications in construction: Current state, opportunities & challenges

S Paneru, I Jeelani - Automation in Construction, 2021 - Elsevier
Thousands of images and videos are collected from construction projects during
construction. These contain valuable data that, if harnessed efficiently, can help automate or …

Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

Block-nerf: Scalable large scene neural view synthesis

M Tancik, V Casser, X Yan, S Pradhan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We present Block-NeRF, a variant of Neural Radiance Fields that can represent
large-scale environments. Specifically, we demonstrate that when scaling NeRF to render …

Grid-guided neural radiance fields for large urban scenes

L Xu, Y Xiangli, S Peng, X Pan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Purely MLP-based neural radiance fields (NeRF-based methods) often suffer from
underfitting with blurred renderings on large-scale scenes due to limited model capacity …

Vastgaussian: Vast 3d gaussians for large scene reconstruction

J Lin, Z Li, X Tang, J Liu, S Liu, J Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Existing NeRF-based methods for large scene reconstruction often have limitations in visual
quality and rendering speed. While the recent 3D Gaussian Splatting works well on small …

Tanks and temples: Benchmarking large-scale scene reconstruction

A Knapitsch, J Park, QY Zhou, V Koltun - ACM Transactions on Graphics …, 2017 - dl.acm.org
We present a benchmark for image-based 3D reconstruction. The benchmark sequences
were acquired outside the lab, in realistic conditions. Ground-truth data was captured using …

Structure-from-motion revisited

JL Schonberger, JM Frahm - … of the IEEE conference on computer …, 2016 - cv-foundation.org
Abstract Incremental Structure-from-Motion is a prevalent strategy for 3D reconstruction from
unordered image collections. While incremental reconstruction systems have tremendously …

ROSEFusion: random optimization for online dense reconstruction under fast camera motion

J Zhang, C Zhu, L Zheng, K Xu - ACM Transactions on Graphics (TOG), 2021 - dl.acm.org
Online reconstruction based on RGB-D sequences has thus far been restrained to relatively
slow camera motions (< 1m/s). Under very fast camera motion (eg, 3m/s), the reconstruction …

Learning a multi-view stereo machine

A Kar, C Häne, J Malik - Advances in neural information …, 2017 - proceedings.neurips.cc
We present a learnt system for multi-view stereopsis. In contrast to recent learning based
methods for 3D reconstruction, we leverage the underlying 3D geometry of the problem …

Compressed video action recognition

CY Wu, M Zaheer, H Hu, R Manmatha… - Proceedings of the …, 2018 - openaccess.thecvf.com
Training robust deep video representations has proven to be much more challenging than
learning deep image representations. This is in part due to the enormous size of raw video …