Thanks, @Jackmin108 . Do you mind opening a PR to update the context with references via: https://github.com/huggingface/blog/blob/main/async-rl-training-landscape.md
Kashif Rasul
kashif
AI & ML interests
Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning
Recent Activity
commentedon their article about 8 hours ago
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries new activity 6 days ago
inclusionAI/LLaDA2.1-mini:fixes for transforemrs v5