Abliterated models performance
treadon/granite-4.1-3b-Abliterated-AND-Disinhibited
quant arc arc/e boolq hswag obkqa piqa wino
mxfp8 0.405,0.598,0.843,0.520,0.442,0.713,0.582
Quant Perplexity Peak Memory Tokens/sec
mxfp8 11.595 ± 0.129 6.58 GB 1594
granite-4.1-3b
mxfp8 0.406,0.581,0.821,0.484,0.434,0.712,0.559
Quant Perplexity Peak Memory Tokens/sec
mxfp8 11.346 ± 0.127 6.58 GB 1690
treadon/granite-4.1-8b-Abliterated-AND-Disinhibited
mxfp8 0.496,0.692,0.864,0.666,0.466,0.770,0.632
Quant Perplexity Peak Memory Tokens/sec
mxfp8 9.518 ± 0.094 11.75 GB 686
granite-4.1-8b
mxfp8 0.486,0.666,0.875,0.636,0.450,0.766,0.631
Quant Perplexity Peak Memory Tokens/sec
mxfp8 10.134 ± 0.107 12.17 GB 668