5 18 12

Yanheng He

henryhe0123

https://henryhe0123.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training

upvoted a paper 13 days ago

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

upvoted a paper 22 days ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

View all activity

Organizations

upvoted a paper 1 day ago

Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training

Paper • 2602.07824 • Published 11 days ago • 14

upvoted a paper 13 days ago

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Paper • 2602.02619 • Published 16 days ago • 50

upvoted a paper 22 days ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published 23 days ago • 124

upvoted a paper 27 days ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published 27 days ago • 52

upvoted a paper 30 days ago

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Paper • 2601.11044 • Published Jan 16 • 34

upvoted a paper about 2 months ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 65

upvoted 2 papers 4 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 46

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 166

upvoted a paper 5 months ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 104

upvoted a paper 7 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22, 2025 • 63

upvoted a paper 8 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 47

upvoted a paper 9 months ago

Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20, 2025 • 44

upvoted a paper 10 months ago

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Paper • 2504.13828 • Published Apr 18, 2025 • 18

upvoted a paper about 1 year ago

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Paper • 2412.17589 • Published Dec 23, 2024 • 14

upvoted a collection about 1 year ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Dec 31, 2025 • 557

upvoted a paper over 1 year ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64

upvoted 2 articles over 1 year ago

Article

Fast, High-Fidelity LLM Decoding with Regex Constraints

Feb 23, 2024

•

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.07k