Latest run: c4375643-1958-471b-b1b4-e8600a00aa9a | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-07T23:45:55.087227+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| c4375643-1958-471b-b1b4-e8600a00aa9a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:45:55.087227+00:00 |
| 0cd932de-16dd-419c-af78-747691cc7a9c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:45:54.972288+00:00 |
| 51754b02-4fae-4fe3-b08a-af1722b6c320 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:45:54.854986+00:00 |
| 54856320-dd30-4010-801f-e842148039c4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:45:54.737625+00:00 |
| e2fab2b6-bc99-4070-a48a-13446c5de6aa | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:45:54.623419+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| c4375643-1958-471b-b1b4-e8600a00aa9a | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:45:55.087227+00:00 |
| 0cd932de-16dd-419c-af78-747691cc7a9c | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:45:54.972288+00:00 |
| 51754b02-4fae-4fe3-b08a-af1722b6c320 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:45:54.854986+00:00 |
| 54856320-dd30-4010-801f-e842148039c4 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:45:54.737625+00:00 |
| e2fab2b6-bc99-4070-a48a-13446c5de6aa | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:45:54.623419+00:00 |
| c858a242-bfad-47d2-a048-253f29c8a260 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:45:54.500805+00:00 |
| 18433ec0-38e4-46da-a2a8-051bb7410039 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:45:54.388897+00:00 |
| 3ca36617-e908-4ac4-98e9-e8646b847860 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:45:54.273842+00:00 |