A Comprehensive Review of Data‐Driven Co‐Speech Gesture Generation

S Nyatsanga, T Kucherenko, C Ahuja… - Computer Graphics …, 2023 - Wiley Online Library
Gestures that accompany speech are an essential part of natural and efficient embodied
human communication. The automatic generation of such co‐speech gestures is a long …

Human motion generation: A survey

W Zhu, X Ma, D Ro, H Ci, J Zhang, J Shi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Human motion generation aims to generate natural human pose sequences and shows
immense potential for real-world applications. Substantial progress has been made recently …

Generating human motion from textual descriptions with discrete representations

J Zhang, Y Zhang, X Cun, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this work, we investigate a simple and must-known conditional generative framework
based on Vector Quantised-Variational AutoEncoder (VQ-VAE) and Generative Pre-trained …

Edge: Editable dance generation from music

J Tseng, R Castellon, K Liu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Dance is an important human art form, but creating new dances can be difficult and time-
consuming. In this work, we introduce Editable Dance GEneration (EDGE), a state-of-the-art …

Listen, denoise, action! audio-driven motion synthesis with diffusion models

S Alexanderson, R Nagy, J Beskow… - ACM Transactions on …, 2023 - dl.acm.org
Diffusion models have experienced a surge of interest as highly expressive yet efficiently
trainable probabilistic models. We show that these models are an excellent fit for …

Bailando: 3d dance generation by actor-critic gpt with choreographic memory

L Siyao, W Yu, T Gu, C Lin, Q Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Driving 3D characters to dance following a piece of music is highly challenging due to the
spatial constraints applied to poses by choreography norms. In addition, the generated …

Ai choreographer: Music conditioned 3d dance generation with aist++

R Li, S Yang, DA Ross… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We present AIST++, a new multi-modal dataset of 3D dance motion and music, along with
FACT, a Full-Attention Cross-modal Transformer network for generating 3D dance motion …

Rhythmic gesticulator: Rhythm-aware co-speech gesture synthesis with hierarchical neural embeddings

T Ao, Q Gao, Y Lou, B Chen, L Liu - ACM Transactions on Graphics …, 2022 - dl.acm.org
Automatic synthesis of realistic co-speech gestures is an increasingly important yet
challenging task in artificial embodied agent creation. Previous systems mainly focus on …

Livelyspeaker: Towards semantic-aware co-speech gesture generation

Y Zhi, X Cun, X Chen, X Shen, W Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com
Gestures are non-verbal but important behaviors accompanying people's speech. While
previous methods are able to generate speech rhythm-synchronized gestures, the semantic …

Ude: A unified driving engine for human motion generation

Z Zhou, B Wang - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Generating controllable and editable human motion sequences is a key challenge in 3D
Avatar generation. It has been labor-intensive to generate and animate human motion for a …