Higherhrnet: Scale-aware representation learning for bottom-up human pose estimation

C Zheng, W Wu, C Chen, T Yang, S Zhu, J Shen… - ACM Computing …, 2023 - dl.acm.org

Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …

被引用次数：477 相关文章所有 4 个版本

[PDF] springer.com

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

L Alzubaidi, J Zhang, AJ Humaidi, A Al-Dujaili… - Journal of big Data, 2021 - Springer

In the last few years, the deep learning (DL) computing paradigm has been deemed the
Gold Standard in the machine learning (ML) community. Moreover, it has gradually become …

被引用次数：5519 相关文章所有 18 个版本

[PDF] arxiv.org

Alphapose: Whole-body regional multi-person pose estimation and tracking in real-time

HS Fang, J Li, H Tang, C Xu, H Zhu… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org

Accurate whole-body multi-person pose estimation and tracking is an important yet
challenging topic in computer vision. To capture the subtle actions of humans for complex …

被引用次数：436 相关文章所有 8 个版本

[PDF] arxiv.org

Ds-transunet: Dual swin transformer u-net for medical image segmentation

A Lin, B Chen, J Xu, Z Zhang, G Lu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Automatic medical image segmentation has made great progress owing to powerful deep
representation learning. Inspired by the success of self-attention mechanism in transformer …

被引用次数：610 相关文章所有 5 个版本

[PDF] thecvf.com

Yolo-pose: Enhancing yolo for multi person pose estimation using object keypoint similarity loss

D Maji, S Nagori, M Mathew… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

We introduce YOLO-pose, a novel heatmap-free approach for joint detection, and 2D multi-
person pose estimation in an image based on the popular YOLO object detection …

被引用次数：235 相关文章所有 6 个版本

[PDF] thecvf.com

Crossvit: Cross-attention multi-scale vision transformer for image classification

CFR Chen, Q Fan, R Panda - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

The recently developed vision transformer (ViT) has achieved promising results on image
classification compared to convolutional neural networks. Inspired by this, in this paper, we …

被引用次数：1572 相关文章所有 9 个版本

[PDF] thecvf.com

Revisiting skeleton-based action recognition

H Duan, Y Zhao, K Chen, D Lin… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Human skeleton, as a compact representation of human action, has received increasing
attention in recent years. Many skeleton-based action recognition methods adopt GCNs to …

被引用次数：619 相关文章所有 7 个版本

[PDF] thecvf.com

Lite-hrnet: A lightweight high-resolution network

C Yu, B Xiao, C Gao, L Yuan, L Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com

We present an efficient high-resolution network, Lite-HRNet, for human pose estimation. We
start by simply applying the efficient shuffle block in ShuffleNet to HRNet (high-resolution …

被引用次数：411 相关文章所有 7 个版本

[PDF] arxiv.org

Neuman: Neural human radiance field from a single video

W Jiang, KM Yi, G Samei, O Tuzel, A Ranjan - European Conference on …, 2022 - Springer

Photorealistic rendering and reposing of humans is important for enabling augmented reality
experiences. We propose a novel framework to reconstruct the human and the scene that …

被引用次数：176 相关文章所有 5 个版本

[PDF] nature.com

Multi-animal pose estimation, identification and tracking with DeepLabCut

J Lauer, M Zhou, S Ye, W Menegas, S Schneider… - Nature …, 2022 - nature.com

Estimating the pose of multiple animals is a challenging computer vision problem: frequent
interactions cause occlusions and complicate the association of detected keypoints to the …

被引用次数：313 相关文章所有 11 个版本