Latest run: c49edd40-e5a6-48dd-8c5e-a5ffb499770c | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-22T13:42:19.037532+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| c49edd40-e5a6-48dd-8c5e-a5ffb499770c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T13:42:19.037532+00:00 |
| bc6f9ef8-83b2-4aa8-adb0-2715c30f7504 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-22T13:42:18.939432+00:00 |
| d2bd596a-f5ab-4c62-8efb-2960ebdb3a2d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T13:42:18.798881+00:00 |
| 90e70ff6-3af4-4f8f-8465-484e721be17f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-22T13:42:18.708153+00:00 |
| c861b0e1-93ac-47de-b7e9-34a8a43697a3 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-22T13:42:18.620248+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| c49edd40-e5a6-48dd-8c5e-a5ffb499770c | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:42:19.037532+00:00 |
| bc6f9ef8-83b2-4aa8-adb0-2715c30f7504 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T13:42:18.939432+00:00 |
| d2bd596a-f5ab-4c62-8efb-2960ebdb3a2d | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:42:18.798881+00:00 |
| 90e70ff6-3af4-4f8f-8465-484e721be17f | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T13:42:18.708153+00:00 |
| c861b0e1-93ac-47de-b7e9-34a8a43697a3 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:42:18.620248+00:00 |
| 1cb638d7-6150-4d94-8036-f87321f34f58 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:42:18.519055+00:00 |
| 4d9011a6-6801-4e50-8312-7cc1ca6423cc | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-22T13:42:18.423862+00:00 |
| 9ca5e17b-b041-49a4-925d-7810dde7fdb6 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-22T13:42:18.320464+00:00 |