Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
8
1
Yixin Liu
PRO
henryL7
Follow
21world's profile picture
albertvillanova's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
submitted
a paper
2 days ago
Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training
authored
a paper
23 days ago
References Improve LLM Alignment in Non-Verifiable Domains
submitted
a paper
23 days ago
References Improve LLM Alignment in Non-Verifiable Domains
View all activity
Organizations
henryL7
's models
34
Sort: Recently updated
henryL7/qwen2.5-7b-instruct-tulu-gen-dpo-offline-0.5
Text Generation
•
8B
•
Updated
May 13, 2025
henryL7/qwen2.5-7b-instruct-tulu-gen-dpo-offline-0.2
Text Generation
•
8B
•
Updated
May 13, 2025
henryL7/qwen2.5-7b-instruct-tulu-gen-dpo-offline-0.1
Text Generation
•
8B
•
Updated
May 13, 2025
•
1
henryL7/qwen2.5-7b-instruct-tulu-gen-dpo-offline-0.05
Text Generation
•
8B
•
Updated
May 13, 2025
henryL7/qwen2.5-7b-instruct-tulu-gen-dpo-offline-0.01
Text Generation
•
8B
•
Updated
May 13, 2025
henryL7/qwen2.5-7b-instruct-tulu-gen-dpo-offline-0.005
Text Generation
•
8B
•
Updated
May 13, 2025
henryL7/qwen2.5-7b-tulu-gen-eval-dpo-offline-0.2
Text Generation
•
8B
•
Updated
May 13, 2025
•
1
henryL7/qwen2.5-7b-tulu-gen-eval-dpo-offline-0.1
Text Generation
•
8B
•
Updated
May 13, 2025
henryL7/qwen2.5-7b-tulu-gen-eval-dpo-offline-0.005
Text Generation
•
8B
•
Updated
May 13, 2025
•
2
henryL7/qwen2.5-7b-tulu-gen-eval-dpo-offline-0.01
Text Generation
•
8B
•
Updated
May 13, 2025
henryL7/llama3.1-8b-tulu-gen-dpo-offline-0.5
Text Generation
•
8B
•
Updated
May 12, 2025
•
3
henryL7/llama3.1-8b-tulu-gen-dpo-offline-0.2
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/llama3.1-8b-tulu-gen-dpo-offline-0.1
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/llama3.1-8b-tulu-gen-dpo-offline-0.005
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/llama3.1-8b-tulu-gen-dpo-offline-0.01
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/qwen2.5-7b-tulu-gen-dpo-offline-0.5
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/qwen2.5-7b-tulu-gen-dpo-offline-0.2
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/qwen2.5-7b-tulu-gen-dpo-offline-0.1
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/qwen2.5-7b-tulu-gen-dpo-offline-0.005
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/qwen2.5-7b-tulu-gen-dpo-offline-0.01
Text Generation
•
8B
•
Updated
May 12, 2025
henryL7/ipo-qwen2.5-7b
Text Generation
•
8B
•
Updated
May 8, 2025
henryL7/sft-eval-qwen2.5-7b
Text Generation
•
8B
•
Updated
May 7, 2025
henryL7/sft-eval-llama3.1-8b
Text Generation
•
8B
•
Updated
May 7, 2025
henryL7/sft-llama3.1-8b
Text Generation
•
8B
•
Updated
May 7, 2025
henryL7/dpo-llama3.1-8b-0.5
Text Generation
•
8B
•
Updated
May 7, 2025
henryL7/dpo-llama3.1-8b-0.2
Text Generation
•
8B
•
Updated
May 7, 2025
henryL7/dpo-llama3.1-8b-0.1
Text Generation
•
8B
•
Updated
May 7, 2025
henryL7/dpo-llama3.1-8b-0.05
Text Generation
•
8B
•
Updated
May 7, 2025
henryL7/dpo-llama3.1-8b-0.02
Text Generation
•
8B
•
Updated
May 7, 2025
henryL7/dpo-qwen2.5-7b-0.3
Text Generation
•
8B
•
Updated
May 7, 2025
Previous
1
2
Next