Latest run: eb1416e1-b482-46df-9ad6-ed591891876f | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-07T23:11:04.859143+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| eb1416e1-b482-46df-9ad6-ed591891876f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:11:04.859143+00:00 |
| 8a3d77b5-8166-4a1b-a3d6-3b1893ee72ce | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:11:04.742427+00:00 |
| f44373ec-c6cf-4bda-a03a-6ba0a2c4d332 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:11:04.628948+00:00 |
| 11afc00b-176f-42ae-8d47-7b26a77d082c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:11:04.515904+00:00 |
| 5bd78aea-01c2-4f8f-b473-52b4ab79da39 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:11:04.401030+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| eb1416e1-b482-46df-9ad6-ed591891876f | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:11:04.859143+00:00 |
| 8a3d77b5-8166-4a1b-a3d6-3b1893ee72ce | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:11:04.742427+00:00 |
| f44373ec-c6cf-4bda-a03a-6ba0a2c4d332 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:11:04.628948+00:00 |
| 11afc00b-176f-42ae-8d47-7b26a77d082c | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:11:04.515904+00:00 |
| 5bd78aea-01c2-4f8f-b473-52b4ab79da39 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:11:04.401030+00:00 |
| d24e01e7-ed4b-4d54-bcfd-0bdc1816dccb | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:11:04.292112+00:00 |
| e7ee9eab-4417-44d5-bb8f-52658015c50f | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:11:04.179294+00:00 |
| 0a554670-31a3-4801-b3dd-aa787db257e4 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:11:04.067439+00:00 |