plaguss (Agustín Piqueres Lajarín)

upvoted an article about 1 year ago

Article

Open R1: Update #3

open-r1

•

Mar 11, 2025

• 297

upvoted an article over 1 year ago

Article

Open R1: Update #2

open-r1

•

Feb 10, 2025

• 218

upvoted a paper over 1 year ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 258

upvoted 3 articles over 1 year ago

Article

FuseO1-Preview: System-II Reasoning Fusion of LLMs

Wanfq

•

Jan 20, 2025

• 22

Article

Open-R1: Update #1

open-r1

•

Feb 2, 2025

• 305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

upvoted a paper over 1 year ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 101

upvoted an article over 1 year ago

Article

Python Is All You Need? Introducing Dria-Agent-α

andthattoo

•

Jan 10, 2025

• 27

upvoted a collection over 1 year ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6, 2025 • 30

upvoted an article over 1 year ago

Article

Process Reinforcement through Implicit Rewards

ganqu

•

Jan 3, 2025

• 31

upvoted 3 papers over 1 year ago

upvoted a collection over 1 year ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 43

upvoted 2 articles over 1 year ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

davidberenstein1957

•

Nov 21, 2024

• 35

Article

Halo: Open Source Health Tracking with Wearables

cyrilzakka

•

Nov 19, 2024

• 118

upvoted a paper over 1 year ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 24

upvoted 3 articles over 1 year ago

Article

Releasing the largest multilingual open pretraining dataset

Pclanglais

•

Nov 13, 2024

• 107

Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

+5

bwillard, drbh, erikkaum, kc611, remi, umut-sahin, willkurt

•

Oct 22, 2024

• 44

Article

How to build a custom text classifier without days of human labeling

sdiazlor

•

Oct 17, 2024

• 57

Agustín Piqueres Lajarín

AI & ML interests

Organizations

Open R1: Update #3

Open R1: Update #2

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

FuseO1-Preview: System-II Reasoning Fusion of LLMs

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Python Is All You Need? Introducing Dria-Agent-α

Scaling Test-Time Compute with Open Models

Process Reinforcement through Implicit Rewards

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Free Process Rewards without Process Labels

Solving math word problems with process- and outcome-based feedback

SmolVLM

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

Halo: Open Source Health Tracking with Wearables

Aligning Large Language Models via Self-Steering Optimization

Releasing the largest multilingual open pretraining dataset

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

How to build a custom text classifier without days of human labeling

Agustín Piqueres Lajarín

AI & ML interests

Organizations

plaguss's activity

Open R1: Update #3

Open R1: Update #2

FuseO1-Preview: System-II Reasoning Fusion of LLMs

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

Python Is All You Need? Introducing Dria-Agent-α

Process Reinforcement through Implicit Rewards

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

Halo: Open Source Health Tracking with Wearables

Releasing the largest multilingual open pretraining dataset

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

How to build a custom text classifier without days of human labeling