Indoor scene understanding in 2.5/3d for autonomous agents: A survey

M Naseer, S Khan, F Porikli - IEEE access, 2018 - ieeexplore.ieee.org
With the availability of low-cost and compact 2.5/3D visual sensing devices, computer vision
community is experiencing a growing interest in visual scene understanding of indoor …

Learning rich features from RGB-D images for object detection and segmentation

S Gupta, R Girshick, P Arbeláez, J Malik - … 6-12, 2014, Proceedings, Part VII …, 2014 - Springer
In this paper we study the problem of object detection for RGB-D images using semantically
rich image and depth features. We propose a new geocentric embedding for depth images …

Mlcvnet: Multi-level context votenet for 3d object detection

Q Xie, YK Lai, J Wu, Z Wang… - Proceedings of the …, 2020 - openaccess.thecvf.com
In this paper, we address the 3D object detection task by capturing multi-level contextual
information with the self-attention mechanism and multi-scale feature fusion. Most existing …

Towards viewpoint invariant 3d human pose estimation

A Haque, B Peng, Z Luo, A Alahi, S Yeung… - Computer Vision–ECCV …, 2016 - Springer
We propose a viewpoint invariant model for 3D human pose estimation from a single depth
image. To achieve this, our discriminative model embeds local regions into a learned …

Real-time monocular object slam

D Gálvez-López, M Salas, JD Tardós… - Robotics and Autonomous …, 2016 - Elsevier
We present a real-time object-based SLAM system that leverages the largest object
database to date. Our approach comprises two main components:(1) a monocular SLAM …

Occlusion reasoning for object detectionunder arbitrary viewpoint

E Hsiao, M Hebert - IEEE transactions on pattern analysis and …, 2014 - ieeexplore.ieee.org
We present a unified occlusion model for object instance detection under arbitrary viewpoint.
Whereas previous approaches primarily modeled local coherency of occlusions or …

Occlusion-aware hand pose estimation using hierarchical mixture density network

Q Ye, TK Kim - Proceedings of the European conference on …, 2018 - openaccess.thecvf.com
Learning and predicting the pose parameters of a 3D hand model given an image, such as
locations of hand joints, is challenging due to large viewpoint changes and articulations, and …

Actionness ranking with lattice conditional ordinal random fields

W Chen, C Xiong, R Xu… - Proceedings of the IEEE …, 2014 - openaccess.thecvf.com
Action analysis in image and video has been attracting more and more attention in computer
vision. Recognizing specific actions in video clips has been the main focus. We move in a …

Latent-class hough forests for 6 DoF object pose estimation

A Tejani, R Kouskouridas… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
In this paper we present Latent-Class Hough Forests, a method for object detection and 6
DoF pose estimation in heavily cluttered and occluded scenarios. We adapt a state of the art …

Sieving regression forest votes for facial feature detection in the wild

H Yang, I Patras - … of the IEEE International Conference on …, 2013 - openaccess.thecvf.com
In this paper we propose a method for the localization of multiple facial features on
challenging face images. In the regression forests (RF) framework, observations (patches) …