Latest run: ee409b1f-1b7c-4352-88a9-cf7153332ccd | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T19:10:58.581515+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| ee409b1f-1b7c-4352-88a9-cf7153332ccd | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:10:58.581515+00:00 |
| 699d1368-abf2-424f-aff9-3ea444e8605d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T19:10:58.500454+00:00 |
| 4695a4b1-85ac-491b-8e5b-7a5952105bd8 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:10:58.415003+00:00 |
| 80db3b7b-13b8-4de4-ab5f-2bb5504a347e | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T19:10:58.296139+00:00 |
| b6c39354-b509-4dcb-a524-c47caf202f83 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:10:58.199832+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| ee409b1f-1b7c-4352-88a9-cf7153332ccd | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:10:58.581515+00:00 |
| 699d1368-abf2-424f-aff9-3ea444e8605d | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:10:58.500454+00:00 |
| 4695a4b1-85ac-491b-8e5b-7a5952105bd8 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:10:58.415003+00:00 |
| 80db3b7b-13b8-4de4-ab5f-2bb5504a347e | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:10:58.296139+00:00 |
| b6c39354-b509-4dcb-a524-c47caf202f83 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:10:58.199832+00:00 |
| 01f5ee5e-7827-45b2-b043-764b11418960 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:10:58.096583+00:00 |
| 1ae84aa8-70d3-482b-94f1-cd417d162c5c | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:10:58.016944+00:00 |
| e8df49ad-8137-4739-88e3-63f2c2f2b171 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:10:57.938131+00:00 |