Uday Phalak
Uday
AI & ML interests
None yet
Recent Activity
liked a model about 4 hours ago
WeiboAI/VibeThinker-3B liked a dataset 2 days ago
uzaymacar/math-rollouts upvoted an article 16 days ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)