Latest run: 7004b004-e8f2-4d5a-9430-bcd8e01c2e50 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-04-27T16:19:23.994747+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 7004b004-e8f2-4d5a-9430-bcd8e01c2e50 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:19:23.994747+00:00 |
| de13f802-0e37-4a61-bc4b-231c904d1b6c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:19:23.832066+00:00 |
| d56a5b7c-32d2-4fe9-930b-618345265f9c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:19:23.742532+00:00 |
| 40b4c67f-22f7-4b80-9a44-647285729a66 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:19:23.665995+00:00 |
| 138e6d81-c39d-45b6-8f0a-1f54d5ef84f8 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:19:23.599118+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 7004b004-e8f2-4d5a-9430-bcd8e01c2e50 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:19:23.994747+00:00 |
| de13f802-0e37-4a61-bc4b-231c904d1b6c | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:19:23.832066+00:00 |
| d56a5b7c-32d2-4fe9-930b-618345265f9c | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:19:23.742532+00:00 |
| 40b4c67f-22f7-4b80-9a44-647285729a66 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:19:23.665995+00:00 |
| 138e6d81-c39d-45b6-8f0a-1f54d5ef84f8 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:19:23.599118+00:00 |
| de2fea17-f6c9-4c8c-9f21-209ccdc45e9e | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:19:23.536001+00:00 |
| 9b695874-530b-4a04-87a2-cd001e842cc1 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:19:23.470781+00:00 |
| 7033160b-8101-468f-b02d-168c01f77718 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:19:23.396491+00:00 |