YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping

Xu He1,* Haoxian Zhang2,† Hejia Chen3 Changyuan Zheng1 Liyang Chen1
Songlin Tang2 Jiehui Huang4 Xiaoqiang Liu2 Pengfei Wan2 Zhiyong Wu1,5,βœ‰

1Tsinghua University    2Kling Team, Kuaishou Technology    3Beihang University    4HKUST    5CUHK
*Work done at Kling Team, Kuaishou Technology    †Project leader    βœ‰Corresponding author

   

Please refer to the GitHub README for usage.

πŸ“Œ TL;DR

X-Dub is a visual dubbing system that synchronizes a character's lip movements in a video to match arbitrary input audio. This repository hosts the public Wan-based X-Dub release and its pretrained weights.

🌟 Citation

Please cite our paper if you find our work helpful.

@article{he2025from,
  title={From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing},
  author={He, Xu and Zhang, Haoxian and Chen, Hejia and Zheng, Changyuan and Chen, Liyang and Tang, Songlin and Huang, Jiehui and Liu, Xiaoqiang and Wan, Pengfei and Wu, Zhiyong},
  journal={arXiv preprint arXiv:2512.25066},
  year={2025}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Paper for KlingTeam/X-Dub