RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 23 days ago • 29.9k • 9
inference-optimization/Llama-3.1-8B-Instruct_5_bits_mode_heuristic 6B • Updated about 1 month ago • 10
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_hybrid 6B • Updated about 1 month ago • 20
inference-optimization/Llama-3.1-8B-Instruct_5.5_bits_mode_heuristic 6B • Updated about 1 month ago • 10
inference-optimization/Llama-3.1-8B-Instruct_6_bits_mode_heuristic 6B • Updated about 1 month ago • 15
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_hybrid 7B • Updated about 1 month ago • 14
inference-optimization/Llama-3.1-8B-Instruct_6.5_bits_mode_heuristic 7B • Updated about 1 month ago • 27
inference-optimization/Llama-3.1-8B-Instruct_7_bits_mode_heuristic 7B • Updated about 1 month ago • 11
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_hybrid 20B • Updated 15 days ago • 40
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_noise 20B • Updated 14 days ago • 35
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_heuristic 20B • Updated 14 days ago • 39
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_hybrid 22B • Updated 14 days ago • 36
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_noise 22B • Updated 14 days ago • 36
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_heuristic 22B • Updated 14 days ago • 34
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_hybrid 23B • Updated 14 days ago • 36
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_noise 23B • Updated 14 days ago • 32
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_heuristic 23B • Updated 14 days ago • 41
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_hybrid 25B • Updated 14 days ago • 38
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_noise 25B • Updated 14 days ago • 34
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_heuristic 25B • Updated 14 days ago • 39
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_hybrid 26B • Updated 14 days ago • 43
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_noise 26B • Updated 14 days ago • 39
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_heuristic 27B • Updated 14 days ago • 40