Latest run: 8d9ed995-db4b-49ca-94c7-70ee4a5250f6 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T02:19:51.630593+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 8d9ed995-db4b-49ca-94c7-70ee4a5250f6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T02:19:51.630593+00:00 |
| 8e12d08d-6922-4ce2-80a6-a1cc6fba15ee | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T02:19:51.543366+00:00 |
| 62df596a-888c-47d2-882f-60ca117e011b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T02:19:51.473797+00:00 |
| fb3aada7-5e00-486c-90b8-00bb43942f60 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T02:19:51.350846+00:00 |
| 4f1eff05-a6d3-4832-8b89-7453645d5992 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T02:19:51.188586+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 8d9ed995-db4b-49ca-94c7-70ee4a5250f6 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T02:19:51.630593+00:00 |
| 8e12d08d-6922-4ce2-80a6-a1cc6fba15ee | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T02:19:51.543366+00:00 |
| 62df596a-888c-47d2-882f-60ca117e011b | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T02:19:51.473797+00:00 |
| fb3aada7-5e00-486c-90b8-00bb43942f60 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T02:19:51.350846+00:00 |
| 4f1eff05-a6d3-4832-8b89-7453645d5992 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T02:19:51.188586+00:00 |
| c017d997-38ff-41b8-a337-7614a9857b65 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T02:19:51.009645+00:00 |
| c85720d7-fb13-4634-8c17-9dcab3b18f71 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T02:19:50.889334+00:00 |
| e5e40f0c-e836-4ad2-ae5b-35d32dcdfeae | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T02:19:50.755737+00:00 |