ithinkimrishi
's Collections
To read
updated
HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video
Paper
•
2510.05560
•
Published
•
7
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular
Reasoning
Paper
•
2510.06217
•
Published
•
63
Less is More: Recursive Reasoning with Tiny Networks
Paper
•
2510.04871
•
Published
•
501
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper
•
2509.26328
•
Published
•
55
CoDA: Coding LM via Diffusion Adaptation
Paper
•
2510.03270
•
Published
•
42
MemMamba: Rethinking Memory Patterns in State Space Model
Paper
•
2510.03279
•
Published
•
72
33B
•
Updated
•
66.1k
•
255
Thinking with Camera: A Unified Multimodal Model for Camera-Centric
Understanding and Generation
Paper
•
2510.08673
•
Published
•
125
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to
Embodied AI
Paper
•
2510.05684
•
Published
•
141
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video
Narratives
Paper
•
2510.20822
•
Published
•
40
DeepAgent: A General Reasoning Agent with Scalable Toolsets
Paper
•
2510.21618
•
Published
•
99
Video-As-Prompt: Unified Semantic Control for Video Generation
Paper
•
2510.20888
•
Published
•
45
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image
Generation
Paper
•
2510.21583
•
Published
•
30
WorldGrow: Generating Infinite 3D World
Paper
•
2510.21682
•
Published
•
42
Paper
•
2510.18212
•
Published
•
34
Visual Diffusion Models are Geometric Solvers
Paper
•
2510.21697
•
Published
•
19
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via
Hierarchical Model Merging
Paper
•
2510.20479
•
Published
•
11