Latest run: 8b828525-eef8-4628-b7ea-94e481545650 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-07T23:30:52.143283+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 8b828525-eef8-4628-b7ea-94e481545650 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:30:52.143283+00:00 |
| 4758feaa-1440-41aa-b1cd-5e7dddbc5e62 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:52.078330+00:00 |
| 9929f942-8406-4c1f-9790-14854da8deb4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:51.990717+00:00 |
| cfa224d7-0457-4b29-8372-c10209b40e7a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:30:51.907451+00:00 |
| 20bfc80e-5731-46b6-97d0-c461b6e162b9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-07T23:30:51.821993+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 8b828525-eef8-4628-b7ea-94e481545650 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:30:52.143283+00:00 |
| 4758feaa-1440-41aa-b1cd-5e7dddbc5e62 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:52.078330+00:00 |
| 9929f942-8406-4c1f-9790-14854da8deb4 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:51.990717+00:00 |
| cfa224d7-0457-4b29-8372-c10209b40e7a | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:30:51.907451+00:00 |
| 20bfc80e-5731-46b6-97d0-c461b6e162b9 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:30:51.821993+00:00 |
| a3fa178f-4b53-4474-9742-8a57308a22ef | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:30:51.733316+00:00 |
| 69950ac3-9a45-4a72-902b-ea57eebbab86 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-07T23:30:51.656626+00:00 |
| c912961a-db1a-4249-890b-0f545886aea1 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:51.582070+00:00 |