Preference Data - a lblaoke Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

lblaoke 's Collections

Preference Data

Yifan's PPO Models

Preference Data

updated May 29

Dahoas/full-hh-rlhf

Viewer • Updated Feb 23, 2023 • 125k • 1.7k • 86
HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16, 2024 • 187k • 9.34k • 316
PKU-Alignment/PKU-SafeRLHF

Viewer • Updated Oct 18, 2024 • 164k • 6.97k • 166
Skywork/Skywork-Reward-Preference-80K-v0.2

Viewer • Updated Oct 25, 2024 • 77k • 730 • 62
nvidia/HelpSteer3

Viewer • Updated Nov 16 • 133k • 2.69k • 93
allenai/reward-bench

Viewer • Updated Sep 9, 2024 • 8.11k • 5.48k • 103

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs