SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 258
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun • Jan 28, 2025 • 889
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13, 2025 • 101
Scaling Test-Time Compute with Open Models Collection Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6, 2025 • 30
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 87
Solving math word problems with process- and outcome-based feedback Paper • 2211.14275 • Published Nov 25, 2022 • 11
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 43
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK davidberenstein1957 • Nov 21, 2024 • 35
view article Article Halo: Open Source Health Tracking with Wearables cyrilzakka • Nov 19, 2024 • 118
Aligning Large Language Models via Self-Steering Optimization Paper • 2410.17131 • Published Oct 22, 2024 • 24
view article Article Releasing the largest multilingual open pretraining dataset Pclanglais • Nov 13, 2024 • 107
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python +5 bwillard, drbh, erikkaum, kc611, remi, umut-sahin, willkurt • Oct 22, 2024 • 44
view article Article How to build a custom text classifier without days of human labeling sdiazlor • Oct 17, 2024 • 57