zijie tian's picture

zijie tian

zijie-tian

·

https://zijie-tian.github.io

Zijie-Tian

AI & ML interests

Storage for AI

Recent Activity

upvoted a paper about 22 hours ago

HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

upvoted a paper 3 days ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

upvoted an article 13 days ago

MInference 1.0: 10x Faster Million Context Inference with a Single GPU

View all activity

Organizations

zijie-tian 's datasets

None public yet