Latest run: 5f4788f4-890e-414e-b4da-4d37f5bab01c | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-04-27T16:31:02.363820+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 5f4788f4-890e-414e-b4da-4d37f5bab01c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:31:02.363820+00:00 |
| d48e616d-792d-4876-89df-32fec2f86616 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:31:02.278168+00:00 |
| 28cf1d16-73bf-4034-a073-e7d653b7915a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:31:02.146551+00:00 |
| 53b8cfba-9e79-49b3-9024-c5b92ba2b683 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:31:02.059349+00:00 |
| 86f77de9-5045-4dcd-b3da-a823966cce1a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:31:01.947411+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 5f4788f4-890e-414e-b4da-4d37f5bab01c | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:02.363820+00:00 |
| d48e616d-792d-4876-89df-32fec2f86616 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:31:02.278168+00:00 |
| 28cf1d16-73bf-4034-a073-e7d653b7915a | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:31:02.146551+00:00 |
| 53b8cfba-9e79-49b3-9024-c5b92ba2b683 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:02.059349+00:00 |
| 86f77de9-5045-4dcd-b3da-a823966cce1a | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:01.947411+00:00 |
| 0c224e69-d0d5-4d62-9948-c2b919920f49 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:01.857525+00:00 |
| dd8cc810-8960-4ebf-8fe5-bfe32e22755b | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:01.775518+00:00 |
| bf6997e1-ad91-495c-a91e-0a2c91d1146a | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:31:01.681210+00:00 |