Latest run: be55098f-7a99-42a2-80fb-467b813a7f2d | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-10T02:43:36.347901+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| be55098f-7a99-42a2-80fb-467b813a7f2d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-10T02:43:36.347901+00:00 |
| cba8dc7a-b1e7-4794-b0d8-ab6fd66bafdc | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-10T02:43:36.282240+00:00 |
| 9c00ea5d-369d-49b4-8c46-962ffa360838 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-10T02:43:36.203848+00:00 |
| ed4b247f-4522-4c9f-ab51-739648e690b0 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-10T02:43:36.152758+00:00 |
| 1d67dee0-85dc-4c3c-a3b8-1783d8c49df9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-10T02:43:36.101829+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| be55098f-7a99-42a2-80fb-467b813a7f2d | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-10T02:43:36.347901+00:00 |
| cba8dc7a-b1e7-4794-b0d8-ab6fd66bafdc | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-10T02:43:36.282240+00:00 |
| 9c00ea5d-369d-49b4-8c46-962ffa360838 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-10T02:43:36.203848+00:00 |
| ed4b247f-4522-4c9f-ab51-739648e690b0 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-10T02:43:36.152758+00:00 |
| 1d67dee0-85dc-4c3c-a3b8-1783d8c49df9 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-10T02:43:36.101829+00:00 |
| bc93478a-525b-41d5-99dc-e170e2f6a855 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-10T02:43:36.043088+00:00 |
| 8e21870d-ec13-445c-8c20-9749b9a6f43c | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-10T02:43:35.977623+00:00 |
| 64f864a9-080b-4859-8d9c-9cd3a9f6b976 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-10T02:43:35.924728+00:00 |