Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

EvalEval Coalition

Team
community
https://evalevalai.com/
evaluatingevals
evaleval
Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

EvalEvalBot  new activity about 6 hours ago
evaleval/EEE_datastore:[Submission] Journalistic-Bias
deepmage121  new activity about 6 hours ago
evaleval/EEE_datastore:Repair HF PR #26 alphaXiv data to strict schema and canonical identity
EvalEvalBot  new activity about 6 hours ago
evaleval/EEE_datastore:Update HELM to schema version v0.2.2
View all activity

Articles

AI evals are becoming the new compute bottleneck

1 day ago
•
11

Yacine Jernite's profile pictureIrene Solaiman's profile pictureCanyu Chen's profile pictureFelix Friedrich's profile pictureAlina Leidinger's profile pictureMargaret Mitchell's profile pictureJennifer Mickel's profile pictureUsman Gohar's profile pictureLevent Sagun's profile pictureShubham Singh's profile pictureAvijit Ghosh's profile pictureLeshem Choshen's profile pictureAurélien-Morgan CLAUDON's profile pictureAmita Shukla's profile picturePrajna Soni's profile pictureAnshuman Suri's profile pictureJoseph [open/acc] Pollack's profile pictureMowafak Allaham's profile picturewave's profile pictureAli El Filali's profile pictureAndrew Tran's profile pictureMonojit's profile pictureKevin Wei's profile pictureJan Batzner's profile pictureJenny Chim's profile pictureMubashara Akhtar's profile pictureSree Harsha Nelaturu's profile pictureHossein A. (Saeed) Rahmani's profile pictureAbdul Muhsin Hameed's profile pictureSrishti's profile pictureJoshua Noble's profile pictureEvalEval Bot's profile pictureDamian Stachura's profile pictureŠimon Podhajský's profile pictureAnastassia Kornilova's profile pictureInge V's profile pictureAris's profile pictureSriram Mohan's profile pictureTommaso Cerruti's profile pictureImamaShehzad's profile pictureMarek Suppa's profile pictureYifan Mai's profile pictureGeorgia Channing's profile picture

evaleval 's Spaces 4

Running
4

Eval Cards

📋

Standardized evaluation cards for AI models and benchmarks

about 8 hours ago
Running

eval-card-registry

🗂

about 9 hours ago
Running

BenchmarkCard Webhook

📋

Receive and process benchmark data via webhook

about 19 hours ago
Running

README

🤗

Sep 9, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs