Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yifan Mai's picture
3

Yifan Mai

yifanmai
evijit's profile picture 21world's profile picture yjernite's profile picture
·
  • yifanmai

AI & ML interests

None yet

Recent Activity

new activity 5 days ago
evaleval/EEE_datastore:Add HELM Safety v1.17.0 results
authored a paper 8 days ago
VHELM: A Holistic Evaluation of Vision Language Models
authored a paper 8 days ago
AIR-Bench 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies
View all activity

Organizations

Stanford CRFM's profile picture EvalEval Coalition's profile picture

Articles 1

Article
7

AI evals are becoming the new compute bottleneck

Papers 12

arxiv:2511.20836
arxiv:2510.11977
arxiv:2508.21376
arxiv:2505.21972

models 0

None public yet

datasets 3

yifanmai/arabic-enterprise

Viewer • Updated 20 days ago • 721 • 25

yifanmai/czech_bank_qa

Viewer • Updated Dec 19, 2024 • 132 • 690

yifanmai/call-center

Viewer • Updated Aug 28, 2024 • 725 • 3 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs