Latest run: 48398cfa-d04d-469d-9708-fc24277cbfe2 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T12:01:59.266902+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 48398cfa-d04d-469d-9708-fc24277cbfe2 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T12:01:59.266902+00:00 |
| f2b4c3b3-1d3a-4ce7-b580-ba0f98cd5e39 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T12:01:59.030447+00:00 |
| a6d5792f-0109-4238-a612-ac513a317e7a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T12:01:58.680548+00:00 |
| a41406e2-4c7a-4ca2-addb-04b4c75eab9f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T12:01:58.445747+00:00 |
| 3923abb4-983f-44dd-a9d5-4c94499c9e01 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T12:01:58.204714+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 48398cfa-d04d-469d-9708-fc24277cbfe2 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T12:01:59.266902+00:00 |
| f2b4c3b3-1d3a-4ce7-b580-ba0f98cd5e39 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T12:01:59.030447+00:00 |
| a6d5792f-0109-4238-a612-ac513a317e7a | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T12:01:58.680548+00:00 |
| a41406e2-4c7a-4ca2-addb-04b4c75eab9f | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T12:01:58.445747+00:00 |
| 3923abb4-983f-44dd-a9d5-4c94499c9e01 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T12:01:58.204714+00:00 |
| af195889-b003-4b5c-8304-d695685fddc9 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T12:01:57.910680+00:00 |
| e1a0aba9-c506-4bfe-a8e4-634bc2542183 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T12:01:57.691760+00:00 |
| 438af269-d917-4649-a1b6-d54da337ff90 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T12:01:57.464716+00:00 |