arxiv:2410.01180
Kangda Wei
kangdawei
AI & ML interests
None yet
Organizations
models 50
kangdawei/MMR-Sigmoid-DAPO-7B
Text Generation • 8B • Updated • 412
kangdawei/MMR-Sigmoid-DR-GRPO-8B
Text Generation • 8B • Updated
kangdawei/MMR-Sigmoid-DAPO-8B
Text Generation • 8B • Updated • 6
kangdawei/MMR-Sigmoid-DAPO
Text Generation • 2B • Updated • 3
kangdawei/MMR-Sigmoid-GRPO-8B
Text Generation • 8B • Updated • 1
kangdawei/MMR-Sigmoid-GRPO-7B
Text Generation • 8B • Updated • 3
kangdawei/MMR-Sigmoid-DR-GRPO-7B
Text Generation • 8B • Updated
kangdawei/DAPO-8B
Text Generation • 8B • Updated • 2
kangdawei/DAPO-7B
Text Generation • 8B • Updated • 2 • 1
kangdawei/MMR-DAPO-8B
Text Generation • 8B • Updated • 4 • 1
datasets 0
None public yet