CharacterShot: Controllable and Consistent 4D Character Animation

CharacterShot is a controllable and consistent 4D character animation framework that enables any individual designer to create dynamic 3D characters from a single reference character image and a 2D pose sequence.

Introduction

CharacterShot begins by pretraining a powerful 2D character animation model based on a DiT-based image-to-video model (CogVideoX). It lifts the animation model from 2D to 3D through introducing dual-attention module together with camera prior to generate multi-view videos with spatial-temporal and spatial-view consistency. Finally, it employs a novel neighbor-constrained 4D gaussian splatting optimization on these multi-view videos, resulting in continuous and stable 4D character representations.

Citation

@article{gao2025charactershot,
  title={CharacterShot: Controllable and Consistent 4D Character Animation},
  author={Gao, Junyao and Li, Jiaxing and Liu, Wenran and Zeng, Yanhong and Shen, Fei and Chen, Kai and Sun, Yanan and Zhao, Cairong},
  journal={arXiv preprint arXiv:2508.07409},
  year={2025},
}

Acknowledgements

The code is built upon CogVideo.

Downloads last month
9
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support