Latest run: 548e9c71-ef2b-4de2-9a68-971b458b11b5 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-08T21:30:20.992553+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 548e9c71-ef2b-4de2-9a68-971b458b11b5 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T21:30:20.992553+00:00 |
| c53135bf-27be-452a-a549-752e2c4ebaa0 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T21:30:20.887644+00:00 |
| d2d18016-0c68-4289-afee-fa1719c05c80 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T21:30:20.787030+00:00 |
| 9c218c63-5661-411a-87a8-1923a916cee9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T21:30:20.701250+00:00 |
| afc76d08-dbab-4588-a634-856a8b50c5dd | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T21:30:20.609170+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 548e9c71-ef2b-4de2-9a68-971b458b11b5 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-08T21:30:20.992553+00:00 |
| c53135bf-27be-452a-a549-752e2c4ebaa0 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:30:20.887644+00:00 |
| d2d18016-0c68-4289-afee-fa1719c05c80 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:30:20.787030+00:00 |
| 9c218c63-5661-411a-87a8-1923a916cee9 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-08T21:30:20.701250+00:00 |
| afc76d08-dbab-4588-a634-856a8b50c5dd | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-08T21:30:20.609170+00:00 |
| ca31763b-837c-4128-b49f-4edf915506e3 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-08T21:30:20.503866+00:00 |
| 9f8fc943-605c-4cd3-bfdd-f8eaa90377b3 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-08T21:30:20.420227+00:00 |
| bb80e034-5d69-45d7-82ad-ee17c61c4516 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:30:20.338071+00:00 |