Latest run: 5b2c40a9-f434-431f-9ce7-dbda397caadf | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T18:28:22.899929+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 5b2c40a9-f434-431f-9ce7-dbda397caadf | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:28:22.899929+00:00 |
| 3ecf6f32-ee7f-4232-b4b8-0fe84d51cb2d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:28:22.838798+00:00 |
| 5ba6f42e-80f1-4a9e-b82c-b6fe430c5f7b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:28:22.773096+00:00 |
| 398b7760-ba77-47c8-93dc-64b6169eaaea | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:28:22.717215+00:00 |
| 3ac42156-17ba-44e9-add1-174171040b9b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:28:22.653091+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 5b2c40a9-f434-431f-9ce7-dbda397caadf | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:28:22.899929+00:00 |
| 3ecf6f32-ee7f-4232-b4b8-0fe84d51cb2d | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:28:22.838798+00:00 |
| 5ba6f42e-80f1-4a9e-b82c-b6fe430c5f7b | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:28:22.773096+00:00 |
| 398b7760-ba77-47c8-93dc-64b6169eaaea | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:28:22.717215+00:00 |
| 3ac42156-17ba-44e9-add1-174171040b9b | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:28:22.653091+00:00 |
| 339df250-d008-47cf-a0e0-c4e8b86a4b4c | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:28:22.594187+00:00 |
| ac34ec7a-a6ab-4ffb-9f1b-15ee8c16ad7a | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:28:22.536140+00:00 |
| 65909779-d824-4911-9d0c-2290fbe902af | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:28:22.479287+00:00 |