fix: match reference project structure to resolve platform validation 0fd745c padmapriyagosakan commited on Apr 12
Fix: inference.py always exits 0, fixes chain-gate loop, adds full traceback logging 2babded padmapriyagosakan commited on Apr 9
Fix: use 'or' fallback for MODEL_NAME/API_BASE_URL to handle empty-string env vars from evaluator fe9a2be padmapriyagosakan commited on Apr 9
Fix false-positive env var validation: check resolved Python vars, not os.environ directly 82755c0 padmapriyagosakan commited on Apr 9
Set default API_BASE_URL and MODEL_NAME to match sample_interface.py ecb5a90 padmapriyagosakan commited on Apr 4
fix: align step_reward with grade_episode, pin deps, update docs, clean inference 3f78483 padmapriyagosakan commited on Mar 31
feat: enforce investigation discipline + fix easy-task grading + add investigation_hints 622e841 padmapriyagosakan commited on Mar 30
feat: reproducible baseline — fixed seed, correct_action in table, accuracy summary 9ec66a4 padmapriyagosakan commited on Mar 30
fix: per-task reward display uses weighted_reward key from grader 2cf9fa0 padmapriyagosakan commited on Mar 30
feat: run LLM inference baseline + fix SSL and loop guard in inference.py c0df82b padmapriyagosakan commited on Mar 30
feat: Iteration_3 — openenv validate PASS, wire compat, PayOpsReward, OPENAI_API_KEY- Fix completed, Docker verification alone pending- Padma 279779a padmapriyagosakan commited on Mar 30
chore: pre-iteration-1 snapshot — 75/75 e2e passing, baseline bug fixed 0f139ff padmapriyagosakan commited on Mar 27