Latest run: 2f351b4c-b4b9-4cf3-a448-4ea789ac95b5 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-04-28T00:09:52.015876+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 2f351b4c-b4b9-4cf3-a448-4ea789ac95b5 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:09:52.015876+00:00 |
| e9888987-afa0-40a6-a525-f20cfe4e4d37 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:09:51.950399+00:00 |
| 2803ef28-7139-470a-933e-ffe182874e95 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:09:51.887196+00:00 |
| a8fb8985-c6db-4a1c-89bc-ce08cecf7fbb | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:09:51.826448+00:00 |
| 1c6baf50-499f-452b-814f-7fa48843cf86 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:09:51.760383+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 2f351b4c-b4b9-4cf3-a448-4ea789ac95b5 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:09:52.015876+00:00 |
| e9888987-afa0-40a6-a525-f20cfe4e4d37 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:09:51.950399+00:00 |
| 2803ef28-7139-470a-933e-ffe182874e95 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:09:51.887196+00:00 |
| a8fb8985-c6db-4a1c-89bc-ce08cecf7fbb | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:09:51.826448+00:00 |
| 1c6baf50-499f-452b-814f-7fa48843cf86 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:09:51.760383+00:00 |
| f8153e58-6037-432c-9633-c0c3e84a317e | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:09:51.707125+00:00 |
| b4746f76-5b3f-41ac-8693-9c7a361b8588 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:09:51.643088+00:00 |
| 3bc3502f-e572-4cd3-921f-1f3112f4d393 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:09:51.587855+00:00 |