Latest run: b79f2497-af82-48e7-bc5d-adf2f6dbdd82 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-04-27T16:59:35.717526+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| b79f2497-af82-48e7-bc5d-adf2f6dbdd82 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:59:35.717526+00:00 |
| 35d9af66-31df-432f-9296-15d90c2e901f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:59:35.619848+00:00 |
| 34990051-70f1-4e8f-908c-e4453862e313 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:59:35.557560+00:00 |
| ac749e2d-da1a-4520-9d33-f7eb7ed7c91d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:59:35.503235+00:00 |
| 6ca49e3d-33eb-4aeb-abc9-bd07e50c7d11 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:59:35.426241+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| b79f2497-af82-48e7-bc5d-adf2f6dbdd82 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:59:35.717526+00:00 |
| 35d9af66-31df-432f-9296-15d90c2e901f | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:35.619848+00:00 |
| 34990051-70f1-4e8f-908c-e4453862e313 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:35.557560+00:00 |
| ac749e2d-da1a-4520-9d33-f7eb7ed7c91d | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:59:35.503235+00:00 |
| 6ca49e3d-33eb-4aeb-abc9-bd07e50c7d11 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:59:35.426241+00:00 |
| d53de61b-dc40-464f-a1fc-b4389887c2c9 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:59:35.370612+00:00 |
| dd863ac2-172f-44ef-bfd3-ce347ba41d28 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:59:35.316539+00:00 |
| 87c66d09-2ebc-40d8-96ef-66e63f582a8f | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:35.255931+00:00 |