Recovering 3d human mesh from monocular images: A survey
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
State of the art on diffusion models for visual computing
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …
Effective whole-body pose estimation with two-stages distillation
Whole-body pose estimation localizes the human body, hand, face, and foot keypoints in an
image. This task is challenging due to multi-scale body parts, fine-grained localization for …
image. This task is challenging due to multi-scale body parts, fine-grained localization for …
Grounded sam: Assembling open-world models for diverse visual tasks
We introduce Grounded SAM, which uses Grounding DINO as an open-set object detector to
combine with the segment anything model (SAM). This integration enables the detection and …
combine with the segment anything model (SAM). This integration enables the detection and …
Humanmac: Masked motion completion for human motion prediction
Human motion prediction is a classical problem in computer vision and computer graphics,
which has a wide range of practical applications. Previous effects achieve great empirical …
which has a wide range of practical applications. Previous effects achieve great empirical …
Miradata: A large-scale video dataset with long durations and structured captions
Sora's high-motion intensity and long consistent videos have significantly impacted the field
of video generation, attracting unprecedented attention. However, existing publicly available …
of video generation, attracting unprecedented attention. However, existing publicly available …
Large motion model for unified multi-modal motion generation
Human motion generation, a cornerstone technique in animation and video production, has
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …
360-degree Human Video Generation with 4D Diffusion Transformer
We present a novel approach for generating 360-degree high-quality, spatiotemporally
coherent human videos from a single image. Our framework combines the strengths of …
coherent human videos from a single image. Our framework combines the strengths of …
Imugpt 2.0: Language-based cross modality transfer for sensor-based human activity recognition
One of the primary challenges in the field of human activity recognition (HAR) is the lack of
large labeled datasets. This hinders the development of robust and generalizable models …
large labeled datasets. This hinders the development of robust and generalizable models …
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents
Traditional approaches in physics-based motion generation centered around imitation
learning and reward shaping often struggle to adapt to new scenarios. To tackle this …
learning and reward shaping often struggle to adapt to new scenarios. To tackle this …