RLinf/RLinf-OpenVLAOFT-RoboTwin-SFT-place_empty_cup
8B
•
Updated
None defined yet.
$π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training