Latest run: 3e509c3d-c11b-4ff2-9d39-34a58c668eb6 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T19:48:37.054196+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 3e509c3d-c11b-4ff2-9d39-34a58c668eb6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:48:37.054196+00:00 |
| b891a5b8-a4e0-488d-8ec2-b923d53f7467 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T19:48:36.969058+00:00 |
| 05abd959-7108-47f3-ba88-194ae382ed0c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:48:36.891929+00:00 |
| 2f32cd06-19f3-44fc-8bc8-da091eea1664 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T19:48:36.810770+00:00 |
| 1f819e2f-79a2-45e3-ad4e-d916e69961ec | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:48:36.741348+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 3e509c3d-c11b-4ff2-9d39-34a58c668eb6 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:48:37.054196+00:00 |
| b891a5b8-a4e0-488d-8ec2-b923d53f7467 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:48:36.969058+00:00 |
| 05abd959-7108-47f3-ba88-194ae382ed0c | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:48:36.891929+00:00 |
| 2f32cd06-19f3-44fc-8bc8-da091eea1664 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:48:36.810770+00:00 |
| 1f819e2f-79a2-45e3-ad4e-d916e69961ec | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:48:36.741348+00:00 |
| 783daa86-3f91-45c7-af68-1febdeadc7db | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:48:36.645885+00:00 |
| cdabcd39-6336-49bd-87d2-4a6f0dceb1b8 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:48:36.541055+00:00 |
| 71474029-10cc-405f-b0aa-98b06efab531 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:48:36.477944+00:00 |