Running 3.62k The Ultra-Scale Playbook π 3.62k The ultimate guide to training LLM on large GPU Clusters
Qwen/Qwen2.5-Coder-32B-Instruct Text Generation β’ 33B β’ Updated Jan 12, 2025 β’ 213k β’ β’ 1.96k
mattshumer/Reflection-Llama-3.1-70B Text Generation β’ 71B β’ Updated Sep 24, 2024 β’ 359 β’ 1.71k
MaziyarPanahi/MixTAO-7Bx2-MoE-Instruct-v7.0-GGUF Text Generation β’ 13B β’ Updated Feb 4, 2024 β’ 131 β’ 9