Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters
Running Featured 127 Open-LLM performances are plateauing, let’s make the leaderboard steep again 🏔 127 Explore and compare advanced language models on a new leaderboard
Running on Zero 31 Gpt2 Multiplication Predictor 📈 31 Multiply large numbers using different reasoning methods
Running Featured 1.32k FineWeb: decanting the web for the finest text data at scale 🍷 1.32k Read a detailed overview of the FineWeb web‑scale text dataset