Latest run: fe314120-d6ac-49c9-884c-98c2e7b50bb6 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T00:28:37.019588+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| fe314120-d6ac-49c9-884c-98c2e7b50bb6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T00:28:37.019588+00:00 |
| e82502a9-ec41-4274-81ce-08cbb667a143 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T00:28:36.993097+00:00 |
| d28ada90-1f40-452c-a004-f46ab59122eb | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T00:28:36.966227+00:00 |
| fca727cd-09c6-4299-bcfb-3ba11fd23b55 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T00:28:36.928139+00:00 |
| a14c182f-d770-4484-912b-32dc1aac683f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T00:28:36.889855+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| fe314120-d6ac-49c9-884c-98c2e7b50bb6 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T00:28:37.019588+00:00 |
| e82502a9-ec41-4274-81ce-08cbb667a143 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:36.993097+00:00 |
| d28ada90-1f40-452c-a004-f46ab59122eb | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:36.966227+00:00 |
| fca727cd-09c6-4299-bcfb-3ba11fd23b55 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T00:28:36.928139+00:00 |
| a14c182f-d770-4484-912b-32dc1aac683f | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T00:28:36.889855+00:00 |
| 9630aff0-e31a-4daa-bc13-1bacb58e8d41 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T00:28:36.863835+00:00 |
| 21dacaef-7484-4b52-9b9e-d5a24e1c6e48 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T00:28:36.835673+00:00 |
| 68e7783d-0adc-4dd1-be5e-023695383195 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:36.807544+00:00 |