k-l-lambda
/

Kimi-K2-Instruct-FP4-MTP

k-l-lambda commited on Jul 24

Commit

d388331

1 Parent(s): 4536b41

updated hf_quant_config.json to adapt fp4 mlp weights in layer 61.

Files changed (1) hide show

hf_quant_config.json CHANGED Viewed

@@ -70,7 +70,13 @@
             "model.layers.7.self_attn*",
             "model.layers.8.self_attn*",
             "model.layers.9.self_attn*",
-            "model.layers.61*"
         ]
     }
 }

             "model.layers.7.self_attn*",
             "model.layers.8.self_attn*",
             "model.layers.9.self_attn*",
+            "model.layers.61.self_attn*",
+            "model.layers.61.eh_proj*",
+            "model.layers.61.embed_tokens*",
+            "model.layers.61.enorm*",
+            "model.layers.61.hnorm*",
+            "model.layers.61.input_layernorm*",
+            "model.layers.61.shared_head*"
         ]
     }
 }