Latest run: 497d957e-d315-4744-8d98-579c7584b229 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-04-27T16:58:24.744891+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 497d957e-d315-4744-8d98-579c7584b229 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:58:24.744891+00:00 |
| e3689825-4f87-4c2b-b913-f108f42ae820 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:58:24.691747+00:00 |
| bec27586-9f81-4cfe-ad3e-0a079a2c9410 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:58:24.618716+00:00 |
| e85a7890-247f-4cd8-b142-6458f5e3a9a9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:58:24.550182+00:00 |
| c68537f1-b9ab-4c18-96ee-8705bdabe47a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-27T16:58:24.483248+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 497d957e-d315-4744-8d98-579c7584b229 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:58:24.744891+00:00 |
| e3689825-4f87-4c2b-b913-f108f42ae820 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:58:24.691747+00:00 |
| bec27586-9f81-4cfe-ad3e-0a079a2c9410 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:58:24.618716+00:00 |
| e85a7890-247f-4cd8-b142-6458f5e3a9a9 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:58:24.550182+00:00 |
| c68537f1-b9ab-4c18-96ee-8705bdabe47a | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:58:24.483248+00:00 |
| f01c386f-9a89-48ef-857d-52822937679d | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:58:24.400085+00:00 |
| 17aa54ff-7502-466a-8682-c098526203f0 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-04-27T16:58:24.330134+00:00 |
| 6d424f7f-5fd8-46b0-9218-376d0a506fa6 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:58:24.265264+00:00 |