Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lblaoke
's Collections
Preference Data
Draft Models
Yifan's PPO Models
Yifan's RMs
Preference Data
updated
May 29
Upvote
-
Dahoas/full-hh-rlhf
Viewer
•
Updated
Feb 23, 2023
•
125k
•
1.7k
•
86
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
Oct 16, 2024
•
187k
•
9.34k
•
316
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18, 2024
•
164k
•
6.97k
•
166
Skywork/Skywork-Reward-Preference-80K-v0.2
Viewer
•
Updated
Oct 25, 2024
•
77k
•
730
•
62
nvidia/HelpSteer3
Viewer
•
Updated
Nov 16
•
133k
•
2.69k
•
93
allenai/reward-bench
Viewer
•
Updated
Sep 9, 2024
•
8.11k
•
5.48k
•
103
Upvote
-
Share collection
View history
Collection guide
Browse collections