Latest run: 5ea93940-5bce-4dc1-87b5-289dd1fb4af1 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T18:01:40.559847+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 5ea93940-5bce-4dc1-87b5-289dd1fb4af1 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:01:40.559847+00:00 |
| 68c7d327-3b3c-43aa-a337-6ba0f5c5c1a2 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:01:40.501870+00:00 |
| a56785b9-37b0-4027-8b07-d4821940c8be | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:01:40.443314+00:00 |
| 57386936-6600-4688-b54a-0e0552d2181d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:01:40.377027+00:00 |
| da4f08bb-e19f-4f8c-8eac-2dbae10f1228 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:01:40.310490+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 5ea93940-5bce-4dc1-87b5-289dd1fb4af1 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:01:40.559847+00:00 |
| 68c7d327-3b3c-43aa-a337-6ba0f5c5c1a2 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:01:40.501870+00:00 |
| a56785b9-37b0-4027-8b07-d4821940c8be | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:01:40.443314+00:00 |
| 57386936-6600-4688-b54a-0e0552d2181d | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:01:40.377027+00:00 |
| da4f08bb-e19f-4f8c-8eac-2dbae10f1228 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:01:40.310490+00:00 |
| 6b1cf645-839b-4593-94ab-60505b139c2f | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:01:40.252176+00:00 |
| 3b011ae0-88d4-416e-8fc6-81d19dbdb2e7 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:01:40.185847+00:00 |
| 04a818c1-010a-4fb3-8952-3d358afba616 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:01:40.119119+00:00 |