SparDA: Sparse Decoupled Attention for Efficient Long-Context LLM Inference Paper • 2606.04511 • Published Jun 3 • 3
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published May 26 • 145