Latest run: 727c11ed-323b-4838-ad11-6cac11fda0e6 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-04-28T00:24:46.426715+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 727c11ed-323b-4838-ad11-6cac11fda0e6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:24:46.426715+00:00 |
| a6e37640-a2a7-40c7-8b1f-53501bd6407e | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:24:46.314690+00:00 |
| f8ae5b0e-a54a-4ab0-961b-5fb24727c980 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:24:46.240403+00:00 |
| 6a5c8f1e-c092-41f4-8a9d-ce03cf2feb83 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:24:46.159163+00:00 |
| f49b945b-1536-4ac6-95ec-96f8c57fd035 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-04-28T00:24:46.067852+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 727c11ed-323b-4838-ad11-6cac11fda0e6 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:24:46.426715+00:00 |
| a6e37640-a2a7-40c7-8b1f-53501bd6407e | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:24:46.314690+00:00 |
| f8ae5b0e-a54a-4ab0-961b-5fb24727c980 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:24:46.240403+00:00 |
| 6a5c8f1e-c092-41f4-8a9d-ce03cf2feb83 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:24:46.159163+00:00 |
| f49b945b-1536-4ac6-95ec-96f8c57fd035 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:24:46.067852+00:00 |
| e60de365-b62e-47e9-a95b-764fedfb99dd | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:24:45.974213+00:00 |
| 67652736-f17c-4d7f-a874-ec3d7a3cc980 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-04-28T00:24:45.889484+00:00 |
| f1c055d3-b7b2-4068-8ad3-4451d13bc9a5 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:24:45.823009+00:00 |