RecTok: Reconstruction Distillation along Rectified Flow Paper • 2512.13421 • Published 12 days ago • 4
RecTok: Reconstruction Distillation along Rectified Flow Paper • 2512.13421 • Published 12 days ago • 4
Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation Paper • 2512.02457 • Published 25 days ago • 13
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model Paper • 2505.23606 • Published May 29 • 14
On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published May 7 • 82
RelationBooth: Towards Relation-Aware Customized Object Generation Paper • 2410.23280 • Published Oct 30, 2024 • 1
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published Apr 8 • 64
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer Paper • 2503.17350 • Published Mar 21 • 1