UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving Paper โข 2512.09864 โข Published 22 days ago โข 10
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback Paper โข 2505.17908 โข Published May 23, 2025 โข 3
Long-Video Audio Synthesis with Multi-Agent Collaboration Paper โข 2503.10719 โข Published Mar 13, 2025 โข 9
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation Paper โข 2503.01370 โข Published Mar 3, 2025 โข 15
TransPixar: Advancing Text-to-Video Generation with Transparency Paper โข 2501.03006 โข Published Jan 6, 2025 โข 25
GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs Paper โข 2412.11258 โข Published Dec 15, 2024 โข 13
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper โข 2411.13503 โข Published Nov 20, 2024 โข 34
SEED-Story: Multimodal Long Story Generation with Large Language Model Paper โข 2407.08683 โข Published Jul 11, 2024 โข 24
LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching Paper โข 2311.11284 โข Published Nov 19, 2023 โข 20