AmanPriyanshu/tool-reasoning-sft-RESEARCH-rlvr-env-retrieval-source Viewer • Updated 8 days ago • 156k • 32
AmanPriyanshu/tool-reasoning-sft-RESEARCH-openresearcher-dataset-sft-deep-research-agent-data-cleaned Updated 9 days ago • 474 • 1
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenHands-CodeScout_Training_Rollouts Viewer • Updated 9 days ago • 56.8k • 27
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenSeeker-v1-Data Viewer • Updated 9 days ago • 7.19k • 16
AmanPriyanshu/tool-reasoning-sft-RESEARCH-REDSearcher_SFT_10K Viewer • Updated 9 days ago • 9.05k • 18
AmanPriyanshu/reasoning-sft-poor-quality-reasoning-sample-mix Viewer • Updated 17 days ago • 150k • 21
AmanPriyanshu/reasoning-sft-minimax-microsoft-orca-agentinstruct-1M-v1 Viewer • Updated 18 days ago • 945k • 72 • 1
AmanPriyanshu/reasoning-sft-minimax-stratified-kmeans-diverse-reasoning-842K-only Viewer • Updated 18 days ago • 843k • 41
AmanPriyanshu/tool-reasoning-sft-TOOLS-toucan-1.5m-sft-tool-use-data-cleaned-rectified-333k Viewer • Updated 19 days ago • 566k • 35
AmanPriyanshu/RLVR-Env-Retrieval-Source-Retrieval-Synthetic-NVDocs-v1 Viewer • Updated 19 days ago • 100k • 41
AmanPriyanshu/tool-reasoning-sft-CODING-nvidia-Nemotron-Agentic-v1 Viewer • Updated 20 days ago • 331k • 22
AmanPriyanshu/reasoning-sft-Nemotron-Instruction-Following-Chat-v1 Viewer • Updated 20 days ago • 158k • 33
AmanPriyanshu/tool-reasoning-sft-RESEARCH-grill-lab-browsecomp-plus-runs-data-cleaned-rectified Viewer • Updated 23 days ago • 49.9k • 67
AmanPriyanshu/reasoning-sft-Edge-Agent-Reasoning-WebSearch-260K Viewer • Updated 23 days ago • 262k • 33
AmanPriyanshu/tool-reasoning-sft-CODING-allenai-SERA-data-cleaned-rectified Viewer • Updated 23 days ago • 211k • 35
AmanPriyanshu/tool-reasoning-sft-TOOLS-hermes-reasoning-tool-style-data-cleaned-rectified-115k Viewer • Updated 23 days ago • 115k • 50
AmanPriyanshu/RLVR-Env-Retrieval-Source-code-search-net-python Viewer • Updated 24 days ago • 100k • 42
AmanPriyanshu/RLVR-Env-Retrieval-Source-code-search-net-javascript Viewer • Updated 24 days ago • 100k • 26
AmanPriyanshu/tool-reasoning-sft-CODING-CoVe-12k-data-cleaned-rectified Viewer • Updated 27 days ago • 12k • 35
AmanPriyanshu/tool-reasoning-sft-CODING-MEnvData-SWE-Trajectory-data-cleaned-rectified Viewer • Updated 27 days ago • 3.92k • 38 • 1