Deep learning-based human pose estimation: A survey

C Zheng, W Wu, C Chen, T Yang, S Zhu, J Shen… - ACM Computing …, 2023 - dl.acm.org
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

L Alzubaidi, J Zhang, AJ Humaidi, A Al-Dujaili… - Journal of big Data, 2021 - Springer
In the last few years, the deep learning (DL) computing paradigm has been deemed the
Gold Standard in the machine learning (ML) community. Moreover, it has gradually become …

Alphapose: Whole-body regional multi-person pose estimation and tracking in real-time

HS Fang, J Li, H Tang, C Xu, H Zhu… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Accurate whole-body multi-person pose estimation and tracking is an important yet
challenging topic in computer vision. To capture the subtle actions of humans for complex …

Ds-transunet: Dual swin transformer u-net for medical image segmentation

A Lin, B Chen, J Xu, Z Zhang, G Lu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Automatic medical image segmentation has made great progress owing to powerful deep
representation learning. Inspired by the success of self-attention mechanism in transformer …

Yolo-pose: Enhancing yolo for multi person pose estimation using object keypoint similarity loss

D Maji, S Nagori, M Mathew… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We introduce YOLO-pose, a novel heatmap-free approach for joint detection, and 2D multi-
person pose estimation in an image based on the popular YOLO object detection …

Crossvit: Cross-attention multi-scale vision transformer for image classification

CFR Chen, Q Fan, R Panda - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
The recently developed vision transformer (ViT) has achieved promising results on image
classification compared to convolutional neural networks. Inspired by this, in this paper, we …

Revisiting skeleton-based action recognition

H Duan, Y Zhao, K Chen, D Lin… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Human skeleton, as a compact representation of human action, has received increasing
attention in recent years. Many skeleton-based action recognition methods adopt GCNs to …

Lite-hrnet: A lightweight high-resolution network

C Yu, B Xiao, C Gao, L Yuan, L Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present an efficient high-resolution network, Lite-HRNet, for human pose estimation. We
start by simply applying the efficient shuffle block in ShuffleNet to HRNet (high-resolution …

Neuman: Neural human radiance field from a single video

W Jiang, KM Yi, G Samei, O Tuzel, A Ranjan - European Conference on …, 2022 - Springer
Photorealistic rendering and reposing of humans is important for enabling augmented reality
experiences. We propose a novel framework to reconstruct the human and the scene that …

Multi-animal pose estimation, identification and tracking with DeepLabCut

J Lauer, M Zhou, S Ye, W Menegas, S Schneider… - Nature …, 2022 - nature.com
Estimating the pose of multiple animals is a challenging computer vision problem: frequent
interactions cause occlusions and complicate the association of detected keypoints to the …