Latest run: f9c3be9c-23d4-4f33-8f6f-1f1cf3ea2065 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-07T22:54:09.197197+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| f9c3be9c-23d4-4f33-8f6f-1f1cf3ea2065 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T22:54:09.197197+00:00 |
| 11d85bb4-2b31-447a-bc5d-9f95fbd1aaa8 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T22:54:09.098346+00:00 |
| e2a95322-4258-440f-8b64-d0ae8c93d75a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T22:54:09.032891+00:00 |
| 5398a66e-9a5d-4d40-8f5a-b5499f2874e2 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T22:54:08.948443+00:00 |
| 6df3f3ad-c3ae-4cf1-b36e-1709cc0e3085 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T22:54:08.888320+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| f9c3be9c-23d4-4f33-8f6f-1f1cf3ea2065 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-07T22:54:09.197197+00:00 |
| 11d85bb4-2b31-447a-bc5d-9f95fbd1aaa8 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T22:54:09.098346+00:00 |
| e2a95322-4258-440f-8b64-d0ae8c93d75a | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T22:54:09.032891+00:00 |
| 5398a66e-9a5d-4d40-8f5a-b5499f2874e2 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-07T22:54:08.948443+00:00 |
| 6df3f3ad-c3ae-4cf1-b36e-1709cc0e3085 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-07T22:54:08.888320+00:00 |
| 81fbc9d6-2181-466c-9d62-1e4d0bb1ee68 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-07T22:54:08.805453+00:00 |
| ba2a4b88-3a50-48cd-b8f1-32b5c9e63af2 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-07T22:54:08.738099+00:00 |
| 19c5190d-afc7-4e4e-96e7-a162c11ab8bb | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T22:54:08.672158+00:00 |