Latest run: 1c7e3915-393d-47a1-9ed2-f3ccb6674ebf | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T03:50:00.932796+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 1c7e3915-393d-47a1-9ed2-f3ccb6674ebf | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T03:50:00.932796+00:00 |
| 8691e93e-4fff-4069-b071-0c14103b7719 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T03:50:00.860403+00:00 |
| f30607e0-cb8a-458c-b504-52281e41687f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T03:50:00.790360+00:00 |
| eb886834-e763-482c-a5ad-01d284c5e022 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T03:50:00.720962+00:00 |
| e8fd8454-2a1f-46a6-8f93-e0745799f5e1 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T03:50:00.648715+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 1c7e3915-393d-47a1-9ed2-f3ccb6674ebf | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:50:00.932796+00:00 |
| 8691e93e-4fff-4069-b071-0c14103b7719 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T03:50:00.860403+00:00 |
| f30607e0-cb8a-458c-b504-52281e41687f | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T03:50:00.790360+00:00 |
| eb886834-e763-482c-a5ad-01d284c5e022 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:50:00.720962+00:00 |
| e8fd8454-2a1f-46a6-8f93-e0745799f5e1 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:50:00.648715+00:00 |
| 121edb22-c1fe-44c7-ad31-61efbdd25b6a | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:50:00.569721+00:00 |
| 7482a239-d522-467b-9d6a-2a6f65f94644 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:50:00.509532+00:00 |
| ad98a746-9189-4ac4-94e2-d94648a1d128 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T03:50:00.437176+00:00 |