BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published 15 days ago • 51
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models Paper • 2602.10934 • Published 19 days ago • 49
RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs Paper • 2602.05367 • Published 25 days ago • 7
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 27 days ago • 83
Horizon-LM: A RAM-Centric Architecture for LLM Training Paper • 2602.04816 • Published 26 days ago • 17
No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs Paper • 2602.02103 • Published 28 days ago • 72
MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published 28 days ago • 65
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published 29 days ago • 41
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published 28 days ago • 16
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss Paper • 2602.02493 • Published 28 days ago • 43
FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space Paper • 2602.02092 • Published 28 days ago • 18
Closing the Loop: Universal Repository Representation with RPG-Encoder Paper • 2602.02084 • Published 28 days ago • 82
KAPSO: A Knowledge-grounded framework for Autonomous Program Synthesis and Optimization Paper • 2601.21526 • Published Jan 29 • 2
FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation Paper • 2601.23182 • Published about 1 month ago • 20
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published Jan 30 • 107
DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment Paper • 2601.20218 • Published Jan 28 • 15