Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
3
Yifan Mai
yifanmai
Follow
evijit's profile picture
21world's profile picture
yjernite's profile picture
3 followers
·
2 following
yifanmai
AI & ML interests
None yet
Recent Activity
new
activity
5 days ago
evaleval/EEE_datastore:
Add HELM Safety v1.17.0 results
authored
a paper
8 days ago
VHELM: A Holistic Evaluation of Vision Language Models
authored
a paper
8 days ago
AIR-Bench 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies
View all activity
Organizations
Articles
1
Article
7
AI evals are becoming the new compute bottleneck
Papers
12
arxiv:
2511.20836
arxiv:
2510.11977
arxiv:
2508.21376
arxiv:
2505.21972
Expand 12 papers
models
0
None public yet
datasets
3
Sort: Recently updated
yifanmai/arabic-enterprise
Viewer
•
Updated
20 days ago
•
721
•
25
yifanmai/czech_bank_qa
Viewer
•
Updated
Dec 19, 2024
•
132
•
690
yifanmai/call-center
Viewer
•
Updated
Aug 28, 2024
•
725
•
3
•
4