Latest run: ddf9a5c9-8eaa-4ecb-9859-86da5a9e35e2 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-08T22:27:10.512731+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| ddf9a5c9-8eaa-4ecb-9859-86da5a9e35e2 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T22:27:10.512731+00:00 |
| f691fc0c-4e5d-43c4-960d-e6c0b7e9abf5 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:27:10.382696+00:00 |
| 100a9023-6f44-45ba-868c-a48a0d406f49 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:27:10.250733+00:00 |
| c747e69d-b9f1-4749-8cd5-832f7b206efa | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T22:27:10.112088+00:00 |
| c0ce9794-e8ba-473e-ae01-a973fcf31d09 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-08T22:27:09.943053+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| ddf9a5c9-8eaa-4ecb-9859-86da5a9e35e2 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-08T22:27:10.512731+00:00 |
| f691fc0c-4e5d-43c4-960d-e6c0b7e9abf5 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:27:10.382696+00:00 |
| 100a9023-6f44-45ba-868c-a48a0d406f49 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:27:10.250733+00:00 |
| c747e69d-b9f1-4749-8cd5-832f7b206efa | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-08T22:27:10.112088+00:00 |
| c0ce9794-e8ba-473e-ae01-a973fcf31d09 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-08T22:27:09.943053+00:00 |
| 755b48f9-ff39-4b00-892a-a0d795340a75 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-08T22:27:09.834060+00:00 |
| 154c0499-e637-4605-b476-e04f16be197f | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-08T22:27:09.721610+00:00 |
| 37500f75-748e-44c2-b077-f1b3b4a91c35 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:27:09.566211+00:00 |