10 16 35

Zhouliang Yu

zhouliang

https://zhouliang-yu.github.io

zhouliang-yu

AI & ML interests

Model-Based AI, Reinforcement Learning, Autoformalization

Recent Activity

upvoted a paper 2 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

liked a dataset 3 days ago

BytedTsinghua-SIA/CUDA-Agent-Ops-6K

liked a dataset 8 days ago

Goedel-LM/SFT_dataset_v2

View all activity

Organizations

upvoted a paper 2 days ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published 6 days ago • 74

liked a dataset 3 days ago

BytedTsinghua-SIA/CUDA-Agent-Ops-6K

Viewer • Updated 6 days ago • 6k • 174 • 38

liked a dataset 8 days ago

Goedel-LM/SFT_dataset_v2

Viewer • Updated 3 days ago • 1.75M • 440 • 29

liked 3 datasets 10 days ago

liked a dataset 15 days ago

lm-provers/FineProofs-SFT

Viewer • Updated 20 days ago • 12.1k • 401 • 37

upvoted a paper 20 days ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published about 1 month ago • 11

liked a dataset 23 days ago

FrenzyMath/Herald_proofs

Viewer • Updated May 13, 2025 • 44.6k • 104 • 3

liked a dataset 25 days ago

INSAIT-Institute/OPC

Viewer • Updated Jul 15, 2025 • 4.93k • 103 • 14

liked a dataset 27 days ago

wenjiema02/ProofBench

Viewer • Updated Oct 14, 2025 • 899 • 117 • 7

upvoted a paper 28 days ago

Steering LLMs via Scalable Interactive Oversight

Paper • 2602.04210 • Published about 1 month ago • 18

upvoted an article 30 days ago

Article

What's Automatic Differentiation?

Mar 19, 2024

•

liked 2 datasets about 1 month ago

ulamai/UnsolvedMath

Updated 30 days ago • 132 • 23

phanerozoic/Lean4-Mathlib

Viewer • Updated Jan 10 • 193k • 107 • 2

liked a dataset 2 months ago

nvidia/Nemotron-Math-Proofs-v1

Viewer • Updated Jan 5 • 925k • 1.35k • 103

published a dataset 4 months ago

zhouliang/DEMIMathAnalysis

Viewer • Updated Feb 27, 2025 • 88 • 5

upvoted a paper 4 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

liked a model 5 months ago

nvidia/OpenMath-Nemotron-1.5B

Text Generation • 2B • Updated Apr 30, 2025 • 3.56k • • 28

authored a paper 5 months ago

SimKO: Simple Pass@K Policy Optimization

Paper • 2510.14807 • Published Oct 16, 2025 • 11

Zhouliang Yu

AI & ML interests

Recent Activity

Organizations

zhouliang's activity

What's Automatic Differentiation?