view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 152
view article Article TIL: How a Harmless Refactor Exposed a Hidden CUDA Bug in Vision-Language Models albertvillanova • Oct 22, 2025
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools albertvillanova • May 16, 2025 • 30
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
view article Article We now support VLMs in smolagents! +1 m-ric, merve, albertvillanova • Jan 24, 2025 • 113
view article Article CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard +2 alozowski, SaylorTwift, albertvillanova, clefourrier • Jan 9, 2025 • 21
view article Article CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard +2 alozowski, SaylorTwift, albertvillanova, clefourrier • Jan 9, 2025 • 21