DCAgent2/glm4_7-fixthink-codecontestStep141
8B • Updated
• 8
DCAgent2/codecontest-8B-overlongFilter-DrGRPO-step150
8B • Updated
• 10
DCAgent2/bs64_rloo_n_noct_stri_micr_auto_conv_pref_model_r2e-120
8B • Updated
• 42
DCAgent2/nl2bashGPT5CodexPassed-qwen3-8b-8nodes-sync-logtest
Updated
DCAgent2/swesmith-stack-over5050
Text Generation
• 308k • Updated
• 2
DCAgent2/swesmith-nl2bashseq
308k • Updated
• 3
DCAgent2/stack-swesmithseq
Text Generation
• 308k • Updated
• 16
DCAgent2/stack-bugs-undr3070
Text Generation
• 308k • Updated
• 6
DCAgent2/nl2bash-swesmithseq
308k • Updated
• 2
DCAgent2/nl2bash-swesmith-undr7030
308k • Updated
• 188
DCAgent2/nl2bash-swesmith-reason
Text Generation
• 308k • Updated
• 49
DCAgent2/nl2bash-swesmith-over5050
Text Generation
• 308k • Updated
• 7
DCAgent2/nl2bash-stack-bugs-undr503020
Text Generation
• 308k • Updated
• 15
DCAgent2/nl2bash-stack-bugs-undr203050
Updated
DCAgent2/bugs-swesmith-undr7030
308k • Updated
• 1
DCAgent2/bugs-swesmith-over5050
308k • Updated
• 13
DCAgent2/nl2bash-stack-bugs-over333
308k • Updated
• 15
DCAgent2/swesmith-bugsseq
Text Generation
• 308k • Updated
• 45
DCAgent2/bugs-swesmithseq
Text Generation
• 308k • Updated
• 1
DCAgent2/swesmith-stack-reason
Text Generation
• 308k • Updated
• 49
DCAgent2/bugs-swesmith-reason
Text Generation
• 308k • Updated
• 3
DCAgent2/swesmith-stackseq
Text Generation
• 308k • Updated
• 12
DCAgent2/swesmith-stack-undr7030
Text Generation
• 308k • Updated
• 1
DCAgent2/stack-bugs-undr7030
Text Generation
• 308k • Updated
• 52
Text Generation
• 308k • Updated
• 50
DCAgent2/stack-bugs-over5050
Text Generation
• 308k • Updated
• 14
DCAgent2/nl2bash-stack-bugsseq
Text Generation
• 308k • Updated
• 118
DCAgent2/stack-bugsshuffle
Text Generation
• 308k • Updated
• 20
DCAgent2/bugs-stack-nl2bashseq
Text Generation
• 308k • Updated
• 20
DCAgent2/nl2bash-stack-bugsshuffle
Text Generation
• 308k • Updated
• 5