microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
7.37k
•
1.23k
Generate high-quality text data for LLMs using FineWeb
The ultimate guide to training LLM on large GPU Clusters
Calculate memory usage for model configurations