He Xiao's picture

6

He Xiao

River555

·

https://hxriver.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models

authored a paper 5 days ago

PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models

authored a paper 5 days ago

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

View all activity

Organizations

None yet

authored 4 papers 5 days ago

Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models

Paper • 2508.03332 • Published Aug 5, 2025

PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models

Paper • 2509.16989 • Published Sep 21, 2025 • 1

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Paper • 2505.23932 • Published May 29, 2025

BPDQ: Bit-Plane Decomposition Quantization on a Variable Grid for Large Language Models

Paper • 2602.04163 • Published Feb 4 • 10

upvoted a paper 6 days ago

PTQTP: Post-Training Quantization to Trit-Planes for Large Language Models

Paper • 2509.16989 • Published Sep 21, 2025 • 1

upvoted a paper 2 months ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 47

upvoted a collection 2 months ago

PTQTP

Rebuttal for NIPS25 • 32 items • Updated Jul 29, 2025 • 1

upvoted a paper 2 months ago

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 200

upvoted a paper 3 months ago

SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving

Paper • 2601.01426 • Published Jan 4 • 24

upvoted a paper 10 months ago

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21, 2025 • 49