A Comprehensive Review of Data‐Driven Co‐Speech Gesture Generation
Gestures that accompany speech are an essential part of natural and efficient embodied
human communication. The automatic generation of such co‐speech gestures is a long …
human communication. The automatic generation of such co‐speech gestures is a long …
Human motion generation: A survey
Human motion generation aims to generate natural human pose sequences and shows
immense potential for real-world applications. Substantial progress has been made recently …
immense potential for real-world applications. Substantial progress has been made recently …
Generating human motion from textual descriptions with discrete representations
In this work, we investigate a simple and must-known conditional generative framework
based on Vector Quantised-Variational AutoEncoder (VQ-VAE) and Generative Pre-trained …
based on Vector Quantised-Variational AutoEncoder (VQ-VAE) and Generative Pre-trained …
Edge: Editable dance generation from music
Dance is an important human art form, but creating new dances can be difficult and time-
consuming. In this work, we introduce Editable Dance GEneration (EDGE), a state-of-the-art …
consuming. In this work, we introduce Editable Dance GEneration (EDGE), a state-of-the-art …
Listen, denoise, action! audio-driven motion synthesis with diffusion models
Diffusion models have experienced a surge of interest as highly expressive yet efficiently
trainable probabilistic models. We show that these models are an excellent fit for …
trainable probabilistic models. We show that these models are an excellent fit for …
Bailando: 3d dance generation by actor-critic gpt with choreographic memory
Driving 3D characters to dance following a piece of music is highly challenging due to the
spatial constraints applied to poses by choreography norms. In addition, the generated …
spatial constraints applied to poses by choreography norms. In addition, the generated …
Ai choreographer: Music conditioned 3d dance generation with aist++
We present AIST++, a new multi-modal dataset of 3D dance motion and music, along with
FACT, a Full-Attention Cross-modal Transformer network for generating 3D dance motion …
FACT, a Full-Attention Cross-modal Transformer network for generating 3D dance motion …
Rhythmic gesticulator: Rhythm-aware co-speech gesture synthesis with hierarchical neural embeddings
Automatic synthesis of realistic co-speech gestures is an increasingly important yet
challenging task in artificial embodied agent creation. Previous systems mainly focus on …
challenging task in artificial embodied agent creation. Previous systems mainly focus on …
Livelyspeaker: Towards semantic-aware co-speech gesture generation
Gestures are non-verbal but important behaviors accompanying people's speech. While
previous methods are able to generate speech rhythm-synchronized gestures, the semantic …
previous methods are able to generate speech rhythm-synchronized gestures, the semantic …
Ude: A unified driving engine for human motion generation
Generating controllable and editable human motion sequences is a key challenge in 3D
Avatar generation. It has been labor-intensive to generate and animate human motion for a …
Avatar generation. It has been labor-intensive to generate and animate human motion for a …