Yihe Deng PRO
ydeng9
AI & ML interests
LLM post-training
Recent Activity
updated
a dataset
9 days ago
ydeng9/OpenVLThinker-grpo-hard
updated
a dataset
9 days ago
ydeng9/OpenVLThinker-grpo-medium
published
a dataset
3 months ago
ydeng9/swe-smith-rl-distill