Latest run: 2796168f-2df4-4ae4-9a50-d5238a425ced | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T18:44:38.483817+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 2796168f-2df4-4ae4-9a50-d5238a425ced | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:44:38.483817+00:00 |
| c111d14a-495d-4072-a64e-72321e6d4218 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:44:38.372979+00:00 |
| 90c0153d-87ed-414e-a8c7-b3c292d0ec52 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:44:38.262103+00:00 |
| e39e35d7-178f-4766-845e-4bb9896f1485 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:44:38.150512+00:00 |
| a84aa12f-8cb3-4f68-8a4b-277b0ae28316 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:44:38.039060+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 2796168f-2df4-4ae4-9a50-d5238a425ced | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:44:38.483817+00:00 |
| c111d14a-495d-4072-a64e-72321e6d4218 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:44:38.372979+00:00 |
| 90c0153d-87ed-414e-a8c7-b3c292d0ec52 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:44:38.262103+00:00 |
| e39e35d7-178f-4766-845e-4bb9896f1485 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:44:38.150512+00:00 |
| a84aa12f-8cb3-4f68-8a4b-277b0ae28316 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:44:38.039060+00:00 |
| fc7ba846-f33e-45c1-8dcd-a50b6bea04f6 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:44:37.927507+00:00 |
| 0e0e9f4f-0b8d-400c-be54-5c2b3b0721d2 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:44:37.816357+00:00 |
| f2b5382b-f01f-4beb-af70-74bd22f73b9c | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:44:37.705573+00:00 |