Latest run: a0291428-794e-4ccc-b1e8-508808b682a9 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T18:57:10.324681+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| a0291428-794e-4ccc-b1e8-508808b682a9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:57:10.324681+00:00 |
| 936f4629-86b4-4529-b924-d90fede38b4d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:57:10.243258+00:00 |
| 058ff750-91ee-4d47-9fb2-b3b81df6cfe4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:57:10.180202+00:00 |
| 98f401b3-7341-4182-83aa-3a7cca0e9cc2 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:57:10.109097+00:00 |
| d03198cb-156a-4b1a-a029-0a1095abbdb4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:57:10.021813+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| a0291428-794e-4ccc-b1e8-508808b682a9 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:57:10.324681+00:00 |
| 936f4629-86b4-4529-b924-d90fede38b4d | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:57:10.243258+00:00 |
| 058ff750-91ee-4d47-9fb2-b3b81df6cfe4 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:57:10.180202+00:00 |
| 98f401b3-7341-4182-83aa-3a7cca0e9cc2 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:57:10.109097+00:00 |
| d03198cb-156a-4b1a-a029-0a1095abbdb4 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:57:10.021813+00:00 |
| d9fb4c37-cbdc-4dc7-b420-98df5f6c9d30 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:57:09.942168+00:00 |
| 5a4c056f-733d-487b-9451-5a53518d2627 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:57:09.878455+00:00 |
| fdb484ff-b5c0-40f0-9869-e6fa3bf5d412 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:57:09.814986+00:00 |