Latest run: 3f0eb62d-c10d-4f13-a340-052437c683e6 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T18:23:35.906742+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 3f0eb62d-c10d-4f13-a340-052437c683e6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:23:35.906742+00:00 |
| c9c1639f-cb9d-4f98-8ac1-2e6510a2ad3a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:23:35.858102+00:00 |
| 6d117bdf-414a-4025-a8e6-57a65aa301aa | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:23:35.800615+00:00 |
| 45951227-4fe0-4020-aa1e-9614abac44aa | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:23:35.735194+00:00 |
| be5249c0-7da3-4518-96ff-2779ae2391ff | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:23:35.685491+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 3f0eb62d-c10d-4f13-a340-052437c683e6 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:23:35.906742+00:00 |
| c9c1639f-cb9d-4f98-8ac1-2e6510a2ad3a | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:23:35.858102+00:00 |
| 6d117bdf-414a-4025-a8e6-57a65aa301aa | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:23:35.800615+00:00 |
| 45951227-4fe0-4020-aa1e-9614abac44aa | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:23:35.735194+00:00 |
| be5249c0-7da3-4518-96ff-2779ae2391ff | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:23:35.685491+00:00 |
| 352dc04c-ed10-4d2a-a0b6-6549712ef9ce | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:23:35.619646+00:00 |
| bab825e7-f6ab-4c4e-bc97-be8d138470f3 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:23:35.567145+00:00 |
| 0e80f78e-54d4-4522-98bf-3ac1923bd571 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:23:35.516237+00:00 |