Latest run: 59a86dbe-e69c-45a1-a91c-5f2dd1facd2f | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-22T13:43:12.197678+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 59a86dbe-e69c-45a1-a91c-5f2dd1facd2f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T13:43:12.197678+00:00 |
| e208d309-d6d6-4b06-9433-fd64249ea983 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-22T13:43:12.129405+00:00 |
| 94c06b37-f86c-464f-95f6-01760f64fa66 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T13:43:12.047611+00:00 |
| 34eadd85-8bdf-4838-9413-66fd6bceaa1e | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-22T13:43:11.963076+00:00 |
| 497e04ab-63fa-40d1-a0c6-03f373595425 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T13:43:11.865224+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 59a86dbe-e69c-45a1-a91c-5f2dd1facd2f | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:43:12.197678+00:00 |
| e208d309-d6d6-4b06-9433-fd64249ea983 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T13:43:12.129405+00:00 |
| 94c06b37-f86c-464f-95f6-01760f64fa66 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:43:12.047611+00:00 |
| 34eadd85-8bdf-4838-9413-66fd6bceaa1e | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T13:43:11.963076+00:00 |
| 497e04ab-63fa-40d1-a0c6-03f373595425 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:43:11.865224+00:00 |
| 8dc40c69-cfee-48a7-a915-43c6a111e941 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:43:11.766318+00:00 |
| eb908e13-a2f3-4081-8418-4c73ee43e1dc | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:43:11.677706+00:00 |
| 663a7916-a83f-408d-8337-31deaa477a6e | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T13:43:11.583418+00:00 |