Tommaso Cerruti
Cerru02
·
AI & ML interests
None yet
Recent Activity
new activity 1 day ago
evaleval/EEE_datastore:Update HELM to schema version v0.2.2 authored a paper 2 days ago
CocoaBench: Evaluating Unified Digital Agents in the Wild upvoted an article 3 days ago
AI evals are becoming the new compute bottleneck