Latest run: 4344340d-d46a-4dfe-9c22-cce6a5edb821 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T17:55:53.124520+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 4344340d-d46a-4dfe-9c22-cce6a5edb821 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T17:55:53.124520+00:00 |
| d8659e86-fdc0-490a-8146-c89e49b74dfe | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T17:55:53.045529+00:00 |
| 7152361a-411e-4777-9e86-12a2c0d4e5c4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T17:55:52.966544+00:00 |
| a34c10d4-d6e9-4ad5-a0db-06050938350d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T17:55:52.893281+00:00 |
| c32f7333-d4c7-4910-ad29-017e355e8285 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T17:55:52.822943+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 4344340d-d46a-4dfe-9c22-cce6a5edb821 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T17:55:53.124520+00:00 |
| d8659e86-fdc0-490a-8146-c89e49b74dfe | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T17:55:53.045529+00:00 |
| 7152361a-411e-4777-9e86-12a2c0d4e5c4 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T17:55:52.966544+00:00 |
| a34c10d4-d6e9-4ad5-a0db-06050938350d | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T17:55:52.893281+00:00 |
| c32f7333-d4c7-4910-ad29-017e355e8285 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T17:55:52.822943+00:00 |
| 17061423-64e1-4d55-9b32-15add822d9cc | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T17:55:52.731699+00:00 |
| fe437d2b-c21e-4707-9f77-ef86b4932b31 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T17:55:52.634634+00:00 |
| c71e0b2b-ebf7-4bd1-9b73-ffb952853b38 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T17:55:52.562114+00:00 |