TWIN Datasets and models from the paper "Same or Not? Enhancing Visual Perception in Vision-Language Models" glab-caltech/TWIN-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated 1 day ago • 32 glab-caltech/TWIN-InternVL3_5-1B Image-Text-to-Text • 1B • Updated 1 day ago • 5 • 1 glab-caltech/FGVQA Viewer • Updated 1 day ago • 12k • 27 glab-caltech/TWIN Viewer • Updated 1 day ago • 562k • 38 • 3
VALOR Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" glab-caltech/VALOR-8B 8B • Updated 20 days ago • 61 glab-caltech/VALOR-GroundingDINO Object Detection • Updated 20 days ago
TWIN Datasets and models from the paper "Same or Not? Enhancing Visual Perception in Vision-Language Models" glab-caltech/TWIN-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated 1 day ago • 32 glab-caltech/TWIN-InternVL3_5-1B Image-Text-to-Text • 1B • Updated 1 day ago • 5 • 1 glab-caltech/FGVQA Viewer • Updated 1 day ago • 12k • 27 glab-caltech/TWIN Viewer • Updated 1 day ago • 562k • 38 • 3
VALOR Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" glab-caltech/VALOR-8B 8B • Updated 20 days ago • 61 glab-caltech/VALOR-GroundingDINO Object Detection • Updated 20 days ago