arxiv:2305.18365
KehanGuo
kguo2
·
AI & ML interests
large language model, molecule representation learning
Organizations
models
10
kguo2/Qwen2.5-3B-PRM-GRPO-SingleGPU
Updated
kguo2/finetune_demo
Text Generation
•
8B
•
Updated
•
10
kguo2/Qwen2.5-7B
8B
•
Updated
•
6
kguo2/Qwen2.5-3B
3B
•
Updated
•
6
kguo2/output_dir
Text Generation
•
0.6B
•
Updated
•
15
kguo2/Qwen2.5-3B-GRPO-test
Text Generation
•
3B
•
Updated
•
9
kguo2/Qwen2-0.5B-Instruct
Updated
kguo2/Qwen2.5-7B-GRPO-test
Updated
kguo2/Qwen2-0.5B-GRPO-test
0.5B
•
Updated
•
8
kguo2/llama-2-uspto
Updated
datasets
6
kguo2/prm_data
Viewer
•
Updated
•
2.29k
•
8
kguo2/ift-scaffold-dataset
Viewer
•
Updated
•
28.5k
•
9
kguo2/scaffold_finetune
Viewer
•
Updated
•
29k
•
11
kguo2/smiles-dataset
Viewer
•
Updated
•
29.3k
•
58
kguo2/Qwen2-0.5B-GRPO-test
Viewer
•
Updated
•
29.2k
•
15
kguo2/MolPuzzle_data
Viewer
•
Updated
•
25k
•
59