·
AI & ML interests
IR, NLP
Organizations
models
13
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-easy_query-100k
Text Generation
•
Updated
•
5
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-hard_query-100k
Text Generation
•
Updated
•
6
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-500easy_500hard_query
Text Generation
•
Updated
•
5
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-hard_query
Text Generation
•
Updated
•
5
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-easy_query
Text Generation
•
Updated
•
7
jnian/Qwen2.5-0.5B-Instruct-Open-R1-GRPO
Updated
jnian/Qwen2.5-3B-Instruct-Open-R1-GRPO
Updated
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO
Updated
jnian/Qwen2.5-1.5B-Open-R1-GRPO
Updated
jnian/Qwen2.5-3B-Open-R1-GRPO
Updated