Single-network whole-body pose estimation

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org

Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

被引用次数：166 相关文章所有 5 个版本

Human pose estimation and its application to action recognition: A survey

L Song, G Yu, J Yuan, Z Liu - Journal of Visual Communication and Image …, 2021 - Elsevier

Human pose estimation aims at predicting the poses of human body parts in images or
videos. Since pose motions are often driven by some specific human actions, knowing the …

被引用次数：126 相关文章所有 2 个版本

[PDF] arxiv.org

Alphapose: Whole-body regional multi-person pose estimation and tracking in real-time

HS Fang, J Li, H Tang, C Xu, H Zhu… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org

Accurate whole-body multi-person pose estimation and tracking is an important yet
challenging topic in computer vision. To capture the subtle actions of humans for complex …

被引用次数：355 相关文章所有 8 个版本

[PDF] thecvf.com

Effective whole-body pose estimation with two-stages distillation

Z Yang, A Zeng, C Yuan, Y Li - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Whole-body pose estimation localizes the human body, hand, face, and foot keypoints in an
image. This task is challenging due to multi-scale body parts, fine-grained localization for …

被引用次数：63 相关文章所有 5 个版本

[PDF] thecvf.com

Capturing and inferring dense full-body human-scene contact

CHP Huang, H Yi, M Höschle… - Proceedings of the …, 2022 - openaccess.thecvf.com

Inferring human-scene contact (HSC) is the first step toward understanding how humans
interact with their surroundings. While detecting 2D human-object interaction (HOI) and …

被引用次数：104 相关文章所有 6 个版本

[PDF] thecvf.com

Not all tokens are equal: Human-centric visual analysis via token clustering transformer

W Zeng, S Jin, W Liu, C Qian, P Luo… - Proceedings of the …, 2022 - openaccess.thecvf.com

Vision transformers have achieved great successes in many computer vision tasks. Most
methods generate vision tokens by splitting an image into a regular and fixed grid and …

被引用次数：107 相关文章所有 7 个版本

Monocular expressive body regression through body-driven attention

V Choutas, G Pavlakos, T Bolkart, D Tzionas… - Computer Vision–ECCV …, 2020 - Springer

To understand how people look, interact, or perform tasks, we need to quickly and
accurately capture their 3D body, face, and hands together from an RGB image. Most …

被引用次数：237 相关文章所有 5 个版本

[PDF] arxiv.org

Whole-body human pose estimation in the wild

S Jin, L Xu, J Xu, C Wang, W Liu, C Qian… - Computer Vision–ECCV …, 2020 - Springer

This paper investigates the task of 2D human whole-body pose estimation, which aims to
localize dense landmarks on the entire human body including face, hands, body, and feet …

被引用次数：243 相关文章所有 8 个版本

[PDF] arxiv.org

Rtmpose: Real-time multi-person pose estimation based on mmpose

T Jiang, P Lu, L Zhang, N Ma, R Han, C Lyu… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent studies on 2D pose estimation have achieved excellent performance on public
benchmarks, yet its application in the industrial community still suffers from heavy model …

被引用次数：76 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-channel transformers for multi-articulatory sign language translation

NC Camgoz, O Koller, S Hadfield… - Computer Vision–ECCV …, 2020 - Springer

Sign languages use multiple asynchronous information channels (articulators), not just the
hands but also the face and body, which computational approaches often ignore. In this …

被引用次数：152 相关文章所有 10 个版本