Latest run: 985ae896-4b4b-4069-8575-519e39610e9a | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-04-28T00:49:28.802981+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 985ae896-4b4b-4069-8575-519e39610e9a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:49:28.802981+00:00 |
| 3fe16945-275b-49b1-b3bc-3fc82f696ced | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:49:28.743953+00:00 |
| eeec17eb-df4a-4557-a0e3-7c904b79ff2c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:49:28.670226+00:00 |
| 8fa0ff25-6329-428d-92f1-12b993d7ba2a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:49:28.603455+00:00 |
| 28538792-b660-486c-9b77-66f46290c5d0 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:49:28.539356+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 985ae896-4b4b-4069-8575-519e39610e9a | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:49:28.802981+00:00 |
| 3fe16945-275b-49b1-b3bc-3fc82f696ced | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:49:28.743953+00:00 |
| eeec17eb-df4a-4557-a0e3-7c904b79ff2c | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:49:28.670226+00:00 |
| 8fa0ff25-6329-428d-92f1-12b993d7ba2a | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:49:28.603455+00:00 |
| 28538792-b660-486c-9b77-66f46290c5d0 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:49:28.539356+00:00 |
| 6bac7b26-cb6b-49f8-af77-fbc3e565e53f | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:49:28.468790+00:00 |
| af252f76-2fc0-4dfe-9265-0c4ebc2f7005 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:49:28.386675+00:00 |
| fe96b42f-ecbb-4bf0-8d5c-85770e37d3e5 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:49:28.310234+00:00 |