Latest run: fa850194-780d-454f-a955-d7d71f6d4fad | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T18:06:35.044879+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| fa850194-780d-454f-a955-d7d71f6d4fad | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:06:35.044879+00:00 |
| ba006822-b7fd-49fc-bc36-680592a6b335 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:06:34.768729+00:00 |
| 4e3264fb-e7b7-416e-ad88-76d8a339237a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:06:34.577604+00:00 |
| 6a89093d-88fe-4120-a708-6d1a0b39c364 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-23T18:06:34.422906+00:00 |
| b449b9e2-5ff8-4b38-9912-81d281fd12b5 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-23T18:06:34.260784+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| fa850194-780d-454f-a955-d7d71f6d4fad | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:06:35.044879+00:00 |
| ba006822-b7fd-49fc-bc36-680592a6b335 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:06:34.768729+00:00 |
| 4e3264fb-e7b7-416e-ad88-76d8a339237a | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:06:34.577604+00:00 |
| 6a89093d-88fe-4120-a708-6d1a0b39c364 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:06:34.422906+00:00 |
| b449b9e2-5ff8-4b38-9912-81d281fd12b5 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:06:34.260784+00:00 |
| 5c5d622a-a8b4-450a-97ce-b51fea38e9e2 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:06:34.013373+00:00 |
| 5196e312-c2c7-427e-a246-ff867e5a7dba | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T18:06:33.854221+00:00 |
| 29b2c5e2-a2c9-4389-a0df-9e25e3812821 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T18:06:33.740506+00:00 |