zijie tian
zijie-tian
AI & ML interests
Storage for AI
Recent Activity
upvoted
a
paper
about 22 hours ago
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
upvoted
a
paper
3 days ago
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference
upvoted
an
article
13 days ago
MInference 1.0: 10x Faster Million Context Inference with a Single GPU