Latest run: d9167614-00ed-402b-a1d4-091a254d7709 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T19:15:12.407786+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| d9167614-00ed-402b-a1d4-091a254d7709 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:15:12.407786+00:00 |
| 852f6c22-6abd-4d23-9405-79d1b59bc441 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T19:15:12.328400+00:00 |
| 0d135d4e-dd8e-49d4-8d9a-667f9aa66ee5 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:15:12.282250+00:00 |
| d96d24e7-e176-4e2d-bfe9-943a2f98b4d3 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T19:15:12.200621+00:00 |
| fb732359-e30f-4a61-90b9-95e894e9756a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T19:15:12.136832+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| d9167614-00ed-402b-a1d4-091a254d7709 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:15:12.407786+00:00 |
| 852f6c22-6abd-4d23-9405-79d1b59bc441 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:15:12.328400+00:00 |
| 0d135d4e-dd8e-49d4-8d9a-667f9aa66ee5 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:15:12.282250+00:00 |
| d96d24e7-e176-4e2d-bfe9-943a2f98b4d3 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:15:12.200621+00:00 |
| fb732359-e30f-4a61-90b9-95e894e9756a | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:15:12.136832+00:00 |
| 3f27eb0f-c86a-47e4-9219-7cbe7ea7cc30 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:15:12.080069+00:00 |
| 9208aef1-dc95-4eb4-8b01-9eea44ffc19e | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T19:15:12.009601+00:00 |
| 3555dfdf-c1b3-4438-9926-0919866206b8 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T19:15:11.946271+00:00 |