Latest run: 087001a7-736a-41bd-8a89-06144bbf8b67 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-10T00:42:51.755490+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 087001a7-736a-41bd-8a89-06144bbf8b67 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-10T00:42:51.755490+00:00 |
| fcd7ea4c-a51e-48c6-af6e-ffe317008ae4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-10T00:42:51.702394+00:00 |
| 8820759d-b387-442f-86df-693cd4a3b2af | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-10T00:42:51.637148+00:00 |
| 1ccc0ceb-c281-4a3b-af01-abf5f9a3ec51 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-10T00:42:51.574653+00:00 |
| feb4e94f-4e36-4637-9e75-40c43d9d83cb | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-10T00:42:51.468307+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 087001a7-736a-41bd-8a89-06144bbf8b67 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-10T00:42:51.755490+00:00 |
| fcd7ea4c-a51e-48c6-af6e-ffe317008ae4 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-10T00:42:51.702394+00:00 |
| 8820759d-b387-442f-86df-693cd4a3b2af | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-10T00:42:51.637148+00:00 |
| 1ccc0ceb-c281-4a3b-af01-abf5f9a3ec51 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-10T00:42:51.574653+00:00 |
| feb4e94f-4e36-4637-9e75-40c43d9d83cb | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-10T00:42:51.468307+00:00 |
| 43c0c995-79ce-49c5-9eba-89f9f48ed3d6 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-10T00:42:51.286281+00:00 |
| 735951f3-28b1-45c2-8780-facecf3ad909 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-10T00:42:51.215923+00:00 |
| 92bbdcb4-b400-4393-8f3c-0509753c7ae7 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-10T00:42:51.157931+00:00 |