R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13, 2025 • 37.8k • 205 • 48 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18, 2025 • 5k • 144 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3, 2025 • 5k • 267 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3, 2025 • 5k • 20 • 2
R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13, 2025 • 37.8k • 205 • 48 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18, 2025 • 5k • 144 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3, 2025 • 5k • 267 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3, 2025 • 5k • 20 • 2