view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 11 days ago • 234
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2 +4 Aug 21, 2024 • 42