Latest run: 376d868f-1a71-41e4-9fc5-5aefd7d15b80 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T20:25:35.586459+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 376d868f-1a71-41e4-9fc5-5aefd7d15b80 | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.310 | 2026-05-23T20:25:35.586459+00:00 |
| 99cb5dd1-c119-4917-a8d3-32eb5337bf9d | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.740 | 2026-05-23T20:25:35.538977+00:00 |
| 4c8b9f0e-1304-4c61-8de2-e70e6002684d | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.310 | 2026-05-23T20:25:35.471291+00:00 |
| 353f719f-6081-44fc-8f45-06158bca210d | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.740 | 2026-05-23T20:25:35.395627+00:00 |
| acd30ba9-1ed7-43ef-91a4-13fed5b4191b | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.310 | 2026-05-23T20:25:35.327390+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 376d868f-1a71-41e4-9fc5-5aefd7d15b80 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:25:35.586459+00:00 |
| 99cb5dd1-c119-4917-a8d3-32eb5337bf9d | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:25:35.538977+00:00 |
| 4c8b9f0e-1304-4c61-8de2-e70e6002684d | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:25:35.471291+00:00 |
| 353f719f-6081-44fc-8f45-06158bca210d | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:25:35.395627+00:00 |
| acd30ba9-1ed7-43ef-91a4-13fed5b4191b | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:25:35.327390+00:00 |
| 4b0c8ed3-c437-4e16-ba9c-0a2208f16081 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:25:35.253742+00:00 |
| c028d978-cf77-47e7-acc7-15364d6787a0 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:25:35.191179+00:00 |
| a1933ace-0aaa-448a-851d-746a9fc5e928 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:25:35.126951+00:00 |