Deep learning-based human pose estimation: A survey
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
Efficientvit: Lightweight multi-scale attention for high-resolution dense prediction
High-resolution dense prediction enables many appealing real-world applications, such as
computational photography, autonomous driving, etc. However, the vast computational cost …
computational photography, autonomous driving, etc. However, the vast computational cost …
Human pose as compositional tokens
Human pose is typically represented by a coordinate vector of body joints or their heatmap
embeddings. While easy for data processing, unrealistic pose estimates are admitted due to …
embeddings. While easy for data processing, unrealistic pose estimates are admitted due to …
Ppt: token-pruned pose transformer for monocular and multi-view human pose estimation
Recently, the vision transformer and its variants have played an increasingly important role
in both monocular and multi-view human pose estimation. Considering image patches as …
in both monocular and multi-view human pose estimation. Considering image patches as …
[HTML][HTML] An enhanced real-time human pose estimation method based on modified YOLOv8 framework
C Dong, G Du - Scientific Reports, 2024 - nature.com
The objective of human pose estimation (HPE) derived from deep learning aims to
accurately estimate and predict the human body posture in images or videos via the …
accurately estimate and predict the human body posture in images or videos via the …
Efficient skeleton-based action recognition via joint-mapping strategies
MS Kang, D Kang, HS Kim - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Graph convolutional networks (GCNs) have brought remarkable progress in skeleton-based
action recognition. However, high computational cost and large model size make models …
action recognition. However, high computational cost and large model size make models …
Fast CNN-based single-person 2D human pose estimation for autonomous systems
C Papaioannidis, I Mademlis… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
This paper presents a novel Convolutional Neural Network (CNN) architecture for 2D human
pose estimation from RGB images that balances between high 2D human pose/skeleton …
pose estimation from RGB images that balances between high 2D human pose/skeleton …
Transformer-based rapid human pose estimation network
D Wang, W Xie, Y Cai, X Li, X Liu - Computers & Graphics, 2023 - Elsevier
Most current human pose estimation methods pursue excellent performance via large
models and intensive computational requirements, resulting in slower models. These …
models and intensive computational requirements, resulting in slower models. These …
JAWS: just a wild shot for cinematic transfer in neural radiance fields
This paper presents JAWS, an optimzation-driven approach that achieves the robust transfer
of visual cinematic features from a reference in-the-wild video clip to a newly generated clip …
of visual cinematic features from a reference in-the-wild video clip to a newly generated clip …
[PDF][PDF] EfficientViT: Lightweight multi-scale attention for on-device semantic segmentation
Semantic segmentation enables many appealing realworld applications, such as
computational photography, autonomous driving, etc. However, the vast computational cost …
computational photography, autonomous driving, etc. However, the vast computational cost …