Latest run: 839fab2d-ad51-4d5b-9258-80e7c0efda66 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-04-27T16:31:42.618986+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 839fab2d-ad51-4d5b-9258-80e7c0efda66 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:31:42.618986+00:00 |
| 507ed397-3d53-477b-86a9-e67aaba27f3e | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:31:42.360502+00:00 |
| 1218a675-cbc6-4ce9-a38c-2f9e2c73dc02 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:31:42.264818+00:00 |
| bf8ac965-4e3a-4c04-9987-3b03094f34e5 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:31:42.173783+00:00 |
| 7d6b3d09-1d78-473e-ba8d-6075f32f0ee2 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:31:42.090176+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 839fab2d-ad51-4d5b-9258-80e7c0efda66 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:42.618986+00:00 |
| 507ed397-3d53-477b-86a9-e67aaba27f3e | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:31:42.360502+00:00 |
| 1218a675-cbc6-4ce9-a38c-2f9e2c73dc02 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:31:42.264818+00:00 |
| bf8ac965-4e3a-4c04-9987-3b03094f34e5 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:42.173783+00:00 |
| 7d6b3d09-1d78-473e-ba8d-6075f32f0ee2 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:42.090176+00:00 |
| 83964b84-bef1-4cb7-a92c-82430efe3850 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:41.921937+00:00 |
| 748ea68c-5341-4a44-95bd-a0386c88272e | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:31:41.784058+00:00 |
| 84bc1874-9768-4218-9bad-5591e4e9440c | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:31:41.691992+00:00 |