Latest run: 565069e9-c650-49ba-9aec-0f2ae8934b3e | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-26T10:38:51.442824+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 565069e9-c650-49ba-9aec-0f2ae8934b3e | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.310 | 2026-05-26T10:38:51.442824+00:00 |
| 675bdda5-b8ca-4e56-b9e7-3bb6af13ac77 | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.740 | 2026-05-26T10:38:51.376599+00:00 |
| 4920ea21-4d1b-4b02-8a00-aa417e2eb22f | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.310 | 2026-05-26T10:38:51.310367+00:00 |
| e80c354c-f18b-472e-980b-3add93a8f0e2 | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.740 | 2026-05-26T10:38:51.244885+00:00 |
| 884c5930-c43b-4b06-a88c-0f2d207dc7be | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.310 | 2026-05-26T10:38:51.170093+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 565069e9-c650-49ba-9aec-0f2ae8934b3e | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:38:51.442824+00:00 |
| 675bdda5-b8ca-4e56-b9e7-3bb6af13ac77 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-26T10:38:51.376599+00:00 |
| 4920ea21-4d1b-4b02-8a00-aa417e2eb22f | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:38:51.310367+00:00 |
| e80c354c-f18b-472e-980b-3add93a8f0e2 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-26T10:38:51.244885+00:00 |
| 884c5930-c43b-4b06-a88c-0f2d207dc7be | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:38:51.170093+00:00 |
| 64228873-ebd3-4d13-88d5-26791eaff50c | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:38:51.104096+00:00 |
| 6c8d2659-625b-470c-9310-4aacdf40a057 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:38:51.037977+00:00 |
| d98dbd7c-20d7-4baa-bae6-9bd9426a80f5 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-26T10:38:50.964244+00:00 |