Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models Paper • 2508.03332 • Published Aug 5, 2025
PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models Paper • 2509.16989 • Published Sep 21, 2025 • 1
SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving Paper • 2505.23932 • Published May 29, 2025
BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models Paper • 2602.04163 • Published Feb 4 • 10
PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models Paper • 2509.16989 • Published Sep 21, 2025 • 1
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published Jan 20 • 47
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published Jan 4 • 24
PhyX: Does Your Model Have the "Wits" for Physical Reasoning? Paper • 2505.15929 • Published May 21, 2025 • 49