Latest run: e24722e1-9bfc-460e-b35c-b79c68351e3e | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T14:21:45.766120+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| e24722e1-9bfc-460e-b35c-b79c68351e3e | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T14:21:45.766120+00:00 |
| 4ad4c11d-82b3-4126-87d3-e4144d95d37b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T14:21:45.691938+00:00 |
| 92d63643-1a5b-4ec0-9221-eabf5c8097ec | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T14:21:45.565637+00:00 |
| faac229a-2bcd-41c1-a298-2d809bb63af7 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T14:21:45.477332+00:00 |
| 1d29a647-d1ca-4711-a40f-de51b89ae486 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T14:21:45.368061+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| e24722e1-9bfc-460e-b35c-b79c68351e3e | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:21:45.766120+00:00 |
| 4ad4c11d-82b3-4126-87d3-e4144d95d37b | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T14:21:45.691938+00:00 |
| 92d63643-1a5b-4ec0-9221-eabf5c8097ec | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T14:21:45.565637+00:00 |
| faac229a-2bcd-41c1-a298-2d809bb63af7 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:21:45.477332+00:00 |
| 1d29a647-d1ca-4711-a40f-de51b89ae486 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:21:45.368061+00:00 |
| 1afb36a4-fd42-4841-ae09-4111a09d5470 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:21:45.277672+00:00 |
| e5b28c0c-4867-4e0c-843b-fcc2d4d48d34 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:21:45.174470+00:00 |
| aece2191-d52c-4835-b509-1a27e8daf5e6 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T14:21:45.037117+00:00 |