DCAgent/rl__24GPU_shaped_entropy__mix_v2_baseline_uniform__qwen3base-GLM-4_7-sw Updated about 1 hour ago
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h4_dense_rewards_hard__qwen3base-GLM-4_7-sw Updated about 4 hours ago
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h2_language_proportional__qwen3base-GLM-4_7-sw Updated about 4 hours ago
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h2_language_balanced__qwen3base-GLM-4_7-sw Updated about 5 hours ago
DCAgent/rl__24GPU_shaped_entropy__mix_v2_h1_struggle_zone__qwen3base-GLM-4_7-sw Updated about 5 hours ago