view article Article Beyond LoRA: Can you beat the most popular fine-tuning technique? +2 BenjaminB, sayakpaul, hubnemo, kashif • 14 days ago • 70
view article Article Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP +3 ariG23498, ror, sergiopaniego, pcuenq, sayakpaul • 21 days ago • 50
view article Article The Open Source Community is backing OpenEnv for Agentic RL +17 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego, banghua • 24 days ago • 102
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • May 29 • 131
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • May 14 • 61
Running RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 50
view article Article Announcing the Hugging Face Fellowship Program merve, espejelomar • May 17, 2022 • 16
Running RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation
Running RL 1 Price Negotiation Environment Server 🎭 1 Negotiate prices in an interactive buyer simulation