Gesture Generation

35 papers with code • 4 benchmarks • 6 datasets

Generation of gestures, as a sequence of 3d poses

Benchmarks

Add a Result

These leaderboards are used to track progress in Gesture Generation

Dataset	Best Model	Compare
TED Gesture Dataset	AQ-GT	See all
BEAT2	EMAGE	See all
BEAT	CaMN	See all
DVS128 Gesture	ECSNet	See all

Libraries

Use these libraries to find Gesture Generation models and implementations

aubrey-ao/humanbehavioranimation

2 papers

161

youngseng/diffusestylegesture

2 papers

131

Datasets

Most implemented papers

Most implemented Social Latest No code

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

ARISE-Initiative/robosuite • 25 Sep 2020

robosuite is a simulation framework for robot learning powered by the MuJoCo physics engine.

Paper
Code

The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation

genea-workshop/genea_numerical_evaluations • • 22 Aug 2022

On the other hand, all synthetic motion is found to be vastly less appropriate for the speech than the original motion-capture recordings.

Paper
Code

Learning Individual Styles of Conversational Gesture

amirbar/speech2gesture • CVPR 2019

Specifically, we perform cross-modal translation from "in-the-wild'' monologue speech of a single speaker to their hand and arm motion.

Paper
Code

Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity

ai4r/Gesture-Generation-from-Trimodal-Context • • 4 Sep 2020

In this paper, we present an automatic gesture generation model that uses the multimodal context of speech text, audio, and speaker identity to reliably generate gestures.

Paper
Code

BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis

PantoMatrix/PantoMatrix • • 10 Mar 2022

Achieving realistic, vivid, and human-like synthesized conversational gestures conditioned on multi-modal data is still an unsolved problem due to the lack of available datasets, models and standard evaluation metrics.

Paper
Code

Generating Holistic 3D Human Motion from Speech

yhw-yhw/talkshow • • CVPR 2023

This work addresses the problem of generating 3D holistic body motions from human speech.

Paper
Code

The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings

teonikolov/genea_visualizer • 24 Aug 2023

The effect of the interlocutor is even more subtle, with submitted systems at best performing barely above chance.

Paper
Code

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

PantoMatrix/PantoMatrix • • 31 Dec 2023

We propose EMAGE, a framework to generate full-body human gestures from audio and masked gestures, encompassing facial, local body, hands, and global movements.

Paper
Code