Latest run: dea382af-2d48-4038-a0aa-163b27d451d7 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T20:21:14.843520+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| dea382af-2d48-4038-a0aa-163b27d451d7 | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.310 | 2026-05-23T20:21:14.843520+00:00 |
| 69862265-e653-4dcb-b222-69a69df94815 | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.740 | 2026-05-23T20:21:14.795318+00:00 |
| 041cdba8-c0fe-406f-b432-016a049e01e5 | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.310 | 2026-05-23T20:21:14.731878+00:00 |
| 12646b94-95ca-4887-adab-fc0af052b6d2 | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.740 | 2026-05-23T20:21:14.676318+00:00 |
| 676f22d5-cc31-444e-9d61-91febd3fb66a | coder | 3a1cb59613c43efee035337a7eb0f518754b79e1 | 0.310 | 2026-05-23T20:21:14.605149+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| dea382af-2d48-4038-a0aa-163b27d451d7 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:21:14.843520+00:00 |
| 69862265-e653-4dcb-b222-69a69df94815 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:21:14.795318+00:00 |
| 041cdba8-c0fe-406f-b432-016a049e01e5 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:21:14.731878+00:00 |
| 12646b94-95ca-4887-adab-fc0af052b6d2 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:21:14.676318+00:00 |
| 676f22d5-cc31-444e-9d61-91febd3fb66a | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:21:14.605149+00:00 |
| a82bc04e-5fd3-4a8c-af51-72a4efe2ba41 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:21:14.543442+00:00 |
| 04c7fad1-c705-4679-a3e9-715bdae66ceb | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:21:14.464621+00:00 |
| ac61e722-685f-43e0-bcb4-003a9a2cfea2 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:21:14.420909+00:00 |