Latest run: 073ad34f-6bc8-4e71-a09a-1b857ff873c2 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T18:33:07.562367+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 073ad34f-6bc8-4e71-a09a-1b857ff873c2 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T18:33:07.562367+00:00 |
| 6c778a63-7b3b-46c1-ab33-1456e760e4f4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T18:33:07.519019+00:00 |
| 79af98d8-8258-4e0e-8fe9-389713fe7a93 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T18:33:07.455624+00:00 |
| a6de0837-07c1-4b54-b5b9-f7dae3a66064 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T18:33:07.383051+00:00 |
| 8bd308ab-22e0-46dd-9dd5-d8515c5bdc12 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T18:33:07.330770+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 073ad34f-6bc8-4e71-a09a-1b857ff873c2 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T18:33:07.562367+00:00 |
| 6c778a63-7b3b-46c1-ab33-1456e760e4f4 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T18:33:07.519019+00:00 |
| 79af98d8-8258-4e0e-8fe9-389713fe7a93 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T18:33:07.455624+00:00 |
| a6de0837-07c1-4b54-b5b9-f7dae3a66064 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T18:33:07.383051+00:00 |
| 8bd308ab-22e0-46dd-9dd5-d8515c5bdc12 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T18:33:07.330770+00:00 |
| 3cb64684-b25c-4f95-a3ab-bf46dba156da | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T18:33:07.274594+00:00 |
| 04c3b62c-7b8d-4dd9-a0fc-07ed4eb5a0f3 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T18:33:07.226157+00:00 |
| 5bbafe73-49f5-4553-9790-4fdfc521891f | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T18:33:07.169468+00:00 |