papers
updated
Visual Representation Alignment for Multimodal Large Language Models
Paper
•
2509.07979
•
Published
•
83
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper
•
2509.07980
•
Published
•
101
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth
Paper
•
2509.03867
•
Published
•
210
Why Language Models Hallucinate
Paper
•
2509.04664
•
Published
•
195
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs
via Bi-Mode Annealing and Reinforce Learning
Paper
•
2508.21113
•
Published
•
110
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
Paper
•
2509.00676
•
Published
•
84
Towards a Unified View of Large Language Model Post-Training
Paper
•
2509.04419
•
Published
•
75
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task
Arithmetic
Paper
•
2509.01363
•
Published
•
58
Does DINOv3 Set a New Medical Vision Standard?
Paper
•
2509.06467
•
Published
•
37
Reinforced Visual Perception with Tools
Paper
•
2509.01656
•
Published
•
31