Recent advances of monocular 2d and 3d human pose estimation: A deep learning perspective

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org
Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

Human pose estimation and its application to action recognition: A survey

L Song, G Yu, J Yuan, Z Liu - Journal of Visual Communication and Image …, 2021 - Elsevier
Human pose estimation aims at predicting the poses of human body parts in images or
videos. Since pose motions are often driven by some specific human actions, knowing the …

Alphapose: Whole-body regional multi-person pose estimation and tracking in real-time

HS Fang, J Li, H Tang, C Xu, H Zhu… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Accurate whole-body multi-person pose estimation and tracking is an important yet
challenging topic in computer vision. To capture the subtle actions of humans for complex …

Effective whole-body pose estimation with two-stages distillation

Z Yang, A Zeng, C Yuan, Y Li - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Whole-body pose estimation localizes the human body, hand, face, and foot keypoints in an
image. This task is challenging due to multi-scale body parts, fine-grained localization for …

Capturing and inferring dense full-body human-scene contact

CHP Huang, H Yi, M Höschle… - Proceedings of the …, 2022 - openaccess.thecvf.com
Inferring human-scene contact (HSC) is the first step toward understanding how humans
interact with their surroundings. While detecting 2D human-object interaction (HOI) and …

Not all tokens are equal: Human-centric visual analysis via token clustering transformer

W Zeng, S Jin, W Liu, C Qian, P Luo… - Proceedings of the …, 2022 - openaccess.thecvf.com
Vision transformers have achieved great successes in many computer vision tasks. Most
methods generate vision tokens by splitting an image into a regular and fixed grid and …

Monocular expressive body regression through body-driven attention

V Choutas, G Pavlakos, T Bolkart, D Tzionas… - Computer Vision–ECCV …, 2020 - Springer
To understand how people look, interact, or perform tasks, we need to quickly and
accurately capture their 3D body, face, and hands together from an RGB image. Most …

Whole-body human pose estimation in the wild

S Jin, L Xu, J Xu, C Wang, W Liu, C Qian… - Computer Vision–ECCV …, 2020 - Springer
This paper investigates the task of 2D human whole-body pose estimation, which aims to
localize dense landmarks on the entire human body including face, hands, body, and feet …

Rtmpose: Real-time multi-person pose estimation based on mmpose

T Jiang, P Lu, L Zhang, N Ma, R Han, C Lyu… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent studies on 2D pose estimation have achieved excellent performance on public
benchmarks, yet its application in the industrial community still suffers from heavy model …

Multi-channel transformers for multi-articulatory sign language translation

NC Camgoz, O Koller, S Hadfield… - Computer Vision–ECCV …, 2020 - Springer
Sign languages use multiple asynchronous information channels (articulators), not just the
hands but also the face and body, which computational approaches often ignore. In this …