EvalEval Coalition

community

https://evalevalai.com/

evaluatingevals

Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

EvalEvalBot new activity about 6 hours ago

evaleval/EEE_datastore:[Submission] Journalistic-Bias

deepmage121 new activity about 6 hours ago

evaleval/EEE_datastore:Repair HF PR #26 alphaXiv data to strict schema and canonical identity

EvalEvalBot new activity about 6 hours ago

evaleval/EEE_datastore:Update HELM to schema version v0.2.2

View all activity

Articles

AI evals are becoming the new compute bottleneck

evaleval 's Spaces 4

Eval Cards

Standardized evaluation cards for AI models and benchmarks

eval-card-registry

BenchmarkCard Webhook

Receive and process benchmark data via webhook

README