Latest run: 8663d562-c186-427d-a4a1-1deee6d7801a | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-08T00:44:16.500153+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 8663d562-c186-427d-a4a1-1deee6d7801a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T00:44:16.500153+00:00 |
| bb724fd9-bfa1-4b31-a60b-91f2621f8021 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T00:44:16.431564+00:00 |
| 2b666102-153b-4d7b-9a29-e6e6682d3bf4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T00:44:16.364116+00:00 |
| 821014a0-9cd4-4338-8a47-cde05890b363 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T00:44:16.311153+00:00 |
| 0183f6b2-ff0d-4675-b1ac-a8ec5b476d4b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T00:44:16.256482+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 8663d562-c186-427d-a4a1-1deee6d7801a | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-08T00:44:16.500153+00:00 |
| bb724fd9-bfa1-4b31-a60b-91f2621f8021 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T00:44:16.431564+00:00 |
| 2b666102-153b-4d7b-9a29-e6e6682d3bf4 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T00:44:16.364116+00:00 |
| 821014a0-9cd4-4338-8a47-cde05890b363 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-08T00:44:16.311153+00:00 |
| 0183f6b2-ff0d-4675-b1ac-a8ec5b476d4b | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-08T00:44:16.256482+00:00 |
| bff6e745-870e-4f36-ae38-5a8d475f09ed | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-08T00:44:16.187976+00:00 |
| 7c7f945c-3707-44d8-bfa7-43bc7c279368 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-08T00:44:16.124475+00:00 |
| 0a5ae992-ae38-4857-8423-3887af6207e2 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T00:44:16.050681+00:00 |