PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video

D Wu, Z Yan, H Zha - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We introduce the Panoptic 3D Reconstruction task a unified and holistic scene
understanding task for a monocular video. And we present PanoRecon-a novel framework …

Towards Energy-Efficiency by Navigating the Trilemma of Energy, Latency, and Accuracy

B Tian, Y Pang, M Huzaifa, S Wang… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Extended Reality (XR) enables immersive experiences through untethered headsets but
suffers from stringent battery and resource constraints. Energy-efficient design is crucial to …

MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling

J Ahn, H Choi, S Kim, D Min - arXiv preprint arXiv:2409.02846, 2024 - arxiv.org
In stereo matching, CNNs have traditionally served as the predominant architectures.
Although Transformer-based stereo models have been studied recently, their performance …

Ray-Distance Volume Rendering for Neural Scene Reconstruction

R Yin, Y Chen, S Karaoglu, T Gevers - European Conference on Computer …, 2025 - Springer
Existing methods in neural scene reconstruction utilize the Signed Distance Function (SDF)
to model the density function. However, in indoor scenes, the density computed from the …

FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos

F Langer, J Ju, G Dikov, G Reitmayr… - European Conference on …, 2025 - Springer
Digitising the 3D world into a clean, CAD model-based representation has important
applications for augmented reality and robotics. Current state-of-the-art methods are …

Real-Time 3D Visual Perception by Cross-Dimensional Refined Learning

Z Hong, CP Yue - IEEE Transactions on Circuits and Systems …, 2024 - ieeexplore.ieee.org
We introduce a novel learning method that can effectively perceive both the geometry
structure and semantic labels of a 3D scene in real time. Existing real-time 3D scene …

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

M Hu, W Yin, C Zhang, Z Cai, X Long, H Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce Metric3D v2, a geometric foundation model for zero-shot metric depth and
surface normal estimation from a single image, which is crucial for metric 3D recovery. While …

EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels

Q Tian, Z Chen, H Liao, X Huang, L Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Single-image depth estimation is essential for endoscopy tasks such as localization,
reconstruction, and augmented reality. Most existing methods in surgical scenes focus on in …

The Devil is in the Details: Boosting Guided Depth Super-Resolution via Rethinking Cross-Modal Alignment and Aggregation

X Jiang, Z Kuang, C Guo, R Zhang, L Cai… - arXiv preprint arXiv …, 2024 - arxiv.org
Guided depth super-resolution (GDSR) involves restoring missing depth details using the
high-resolution RGB image of the same scene. Previous approaches have struggled with …

PointRecon: Online Point-based 3D Reconstruction via Ray-based 2D-3D Matching

C Ziwen, Z Xu, L Fuxin - arXiv preprint arXiv:2410.23245, 2024 - arxiv.org
We propose a novel online, point-based 3D reconstruction method from posed monocular
RGB videos. Our model maintains a global point cloud representation of the scene …