LaViDa-R1: Advancing Reasoning for Unified Multimodal Diffusion Language Models Paper • 2602.14147 • Published Feb 15 • 6
ViT-AdaLA: Adapting Vision Transformers with Linear Attention Paper • 2603.16063 • Published 13 days ago • 2
SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generation Paper • 2603.15150 • Published 13 days ago
Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation Paper • 2509.19244 • Published Sep 23, 2025 • 12