29 11

Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Train AI models with Unsloth and Hugging Face Jobs for FREE

upvoted an article 2 months ago

We Got Claude to Build CUDA Kernels and teach open models!

upvoted an article 3 months ago

Deriving the PPO Loss from First Principles

View all activity

Organizations

None yet

liked a model 8 months ago

Menlo/Lucy-128k

Text Generation • 2B • Updated Aug 4, 2025 • 253 • 109

liked a model 9 months ago

chandar-lab/NeoBERT

Feature Extraction • 0.2B • Updated Mar 25, 2025 • 15k • 194

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.76k

The ultimate guide to training LLM on large GPU Clusters

liked 2 models over 1 year ago

Datou1111/shou_xin

Text-to-Image • Updated Mar 16, 2025 • 172 • • 875

lamm-mit/LifeGPT

Updated Sep 19, 2024 • 9

liked a Space over 1 year ago

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

127

Explore and compare advanced language models on a new leaderboard

liked a model over 1 year ago

nisten/Biggie-SmoLlm-0.15B-Base

Text Generation • 0.2B • Updated Aug 7, 2024 • 1.73k • 241

liked a Space over 1 year ago

Gpt2 Multiplication Predictor

📈

Multiply large numbers using different reasoning methods

liked 2 Spaces almost 2 years ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.32k

Read a detailed overview of the FineWeb web‑scale text dataset

Phi-3 WebGPU

🚀

294

A private and powerful AI that runs locally in your browser

liked a model almost 2 years ago

rombodawg/test_dataset_Codellama-3-8B

Text Generation • Updated May 4, 2024 • 6 • 78