Bolian Li
lblaoke
AI & ML interests
None yet
Organizations
None yet
models 44
lblaoke/opt-350m-hh-rlhf-rm-trl-v5
0.3B • Updated
lblaoke/opt-350m-hh-rlhf-dpo-trl-v5
0.3B • Updated
• 1
lblaoke/opt-350m-hh-rlhf-chosen-sft-trl-v5
0.3B • Updated
lblaoke/opt-125m-hh-rlhf-rm-trl-v5
0.1B • Updated
lblaoke/opt-125m-hh-rlhf-dpo-trl-v5
0.1B • Updated
lblaoke/opt-125m-hh-rlhf-chosen-sft-trl-v5
0.1B • Updated
• 1
lblaoke/qwama-0.5b-hh-rlhf-sft-chosen-trl-v4
0.5B • Updated
• 1
lblaoke/qwama-0.5b-skywork-pref-sft-chosen-dpo-trl-v3
0.5B • Updated
• 4
lblaoke/qwama-0.5b-skywork-pref-sft-rejected-chosen-trl-v3
0.5B • Updated
lblaoke/qwama-0.5b-skywork-pref-sft-chosen-trl-v3
0.5B • Updated
• 1
datasets 0
None public yet