EvoClaw: Evaluating AI Agents on Continuous Software Evolution Paper • 2603.13428 • Published 13 days ago • 20
Rethinking the Harmonic Loss via Non-Euclidean Distance Layers Paper • 2603.10225 • Published 15 days ago
view post Post 183 Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into HuggingFace Trainer, Accelerate and TRLFor extensive details please see this writeup:https://huggingface.co/blog/ulysses-spThanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration. See translation 🤗 1 1 + Reply
Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking Paper • 2602.21196 • Published 29 days ago • 5
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL Paper • 2602.03773 • Published Feb 3 • 12
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning Paper • 2602.07075 • Published Feb 6 • 18
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences Paper • 2506.13996 • Published Jun 16, 2025 • 1
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107