dashboard / patch_orig_analysis_with_eval.py

Commit History

Show incomplete runs as incorrect; fix missing questions via BrowseComp JSONL fallback
1eb493c

timchen0618 commited on

Add question/answer/accuracy to Scout Runs tab; fix selected-tools reload cache
30fb9c5

timchen0618 commited on