Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation
Paper • 2605.26844 • Published • 17
None defined yet.
Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring