Latest run: b0a09138-da21-47ec-b0c8-e41ede4a1d47 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-22T14:02:00.091149+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| b0a09138-da21-47ec-b0c8-e41ede4a1d47 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T14:02:00.091149+00:00 |
| 1aea8745-454d-4599-bce0-80ef1b69968b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-22T14:01:59.971081+00:00 |
| 08556792-4126-490c-9646-20ec2618bbe8 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T14:01:59.871810+00:00 |
| 3a8146aa-b45b-4093-9231-b77908e89eb0 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-22T14:01:59.759066+00:00 |
| 9cbed913-e610-4a02-930a-a31478bcd8f9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T14:01:59.656905+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| b0a09138-da21-47ec-b0c8-e41ede4a1d47 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-22T14:02:00.091149+00:00 |
| 1aea8745-454d-4599-bce0-80ef1b69968b | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T14:01:59.971081+00:00 |
| 08556792-4126-490c-9646-20ec2618bbe8 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-22T14:01:59.871810+00:00 |
| 3a8146aa-b45b-4093-9231-b77908e89eb0 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T14:01:59.759066+00:00 |
| 9cbed913-e610-4a02-930a-a31478bcd8f9 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-22T14:01:59.656905+00:00 |
| ea6687ff-9c8b-4ab4-a893-ec5220d81ec8 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-22T14:01:59.549998+00:00 |
| ae9d126e-086b-477a-8be2-65f9f5881aab | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-22T14:01:59.439216+00:00 |
| e7ec4ef8-5321-4191-a8e4-2c53dfa56259 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T14:01:59.334293+00:00 |