Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters
apolo-mind/engineer-heavy-500k-barc-llama3.1-8b-ins-fft-transduction_lr1e-5_epoch3 Text Generation • 8B • Updated Mar 18, 2025 • 17
apolo-mind/engineer-heavy-500k-barc-llama3.1-8b-ins-fft-transduction_lr1e-5_epoch3 Text Generation • 8B • Updated Mar 18, 2025 • 17
apolo-mind/l3.1-8b-inst-fft-induction-barc-heavy-200k-old-200k-lr1e-5-ep3 Text Generation • 8B • Updated Mar 15, 2025 • 6
apolo-mind/l3.1-8b-inst-fft-induction-barc-heavy-200k-old-200k-lr1e-5-ep3 Text Generation • 8B • Updated Mar 15, 2025 • 6
Large Language Models Can Self-Improve At Web Agent Tasks Paper • 2405.20309 • Published May 30, 2024 • 2