Latest run: caf56111-a9bd-42af-b1ea-66bdefad5886 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T09:41:34.206472+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| caf56111-a9bd-42af-b1ea-66bdefad5886 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T09:41:34.206472+00:00 |
| 1fa16e21-9bef-428f-ba6f-4ad21310dbd8 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T09:41:34.147646+00:00 |
| 7acc4d53-d6a7-4f5e-8dc7-2bab58c1af05 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T09:41:34.061504+00:00 |
| 9018536a-d9b7-4d9c-b368-1d2be3ae6073 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T09:41:33.986711+00:00 |
| b4ed4a80-5d7a-4fbd-b345-ba01d39f6257 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T09:41:33.892609+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| caf56111-a9bd-42af-b1ea-66bdefad5886 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T09:41:34.206472+00:00 |
| 1fa16e21-9bef-428f-ba6f-4ad21310dbd8 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T09:41:34.147646+00:00 |
| 7acc4d53-d6a7-4f5e-8dc7-2bab58c1af05 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T09:41:34.061504+00:00 |
| 9018536a-d9b7-4d9c-b368-1d2be3ae6073 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T09:41:33.986711+00:00 |
| b4ed4a80-5d7a-4fbd-b345-ba01d39f6257 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T09:41:33.892609+00:00 |
| 2368363c-3560-4afd-8f46-1a7dc668aa1d | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T09:41:33.833830+00:00 |
| 7a139f4c-3628-4b35-996e-cec1f4b211e5 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T09:41:33.765014+00:00 |
| 19e0f7b9-4ca3-48e8-95dc-e0acefc0a580 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T09:41:33.687445+00:00 |