Latest run: 6e32ab5f-749c-46b1-b3e4-3cdc655b50a2 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-08T22:25:45.660382+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 6e32ab5f-749c-46b1-b3e4-3cdc655b50a2 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:25:45.660382+00:00 |
| 210e679c-1807-4e75-985a-4c03dcaf5b09 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:25:45.388206+00:00 |
| 8b674494-515c-4ac9-bb41-39765d60cb27 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:25:44.522684+00:00 |
| 78cfa06d-8996-425a-8f38-63f793bf2c41 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:25:44.297852+00:00 |
| 689f1e89-ed5c-4cf8-a516-d452ee75a183 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:25:43.955085+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 6e32ab5f-749c-46b1-b3e4-3cdc655b50a2 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:45.660382+00:00 |
| 210e679c-1807-4e75-985a-4c03dcaf5b09 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:45.388206+00:00 |
| 8b674494-515c-4ac9-bb41-39765d60cb27 | python-recovery-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:44.522684+00:00 |
| 78cfa06d-8996-425a-8f38-63f793bf2c41 | typescript-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:44.297852+00:00 |
| 689f1e89-ed5c-4cf8-a516-d452ee75a183 | python-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:43.955085+00:00 |
| 15bf7cb9-4880-4989-bb4b-7cff4c53650a | typescript-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:43.762228+00:00 |
| cedd125a-cda8-4808-ad18-034871a96c7b | python-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:43.527826+00:00 |
| 679b317b-e482-498a-a15b-d282fbf1c5e8 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:43.301946+00:00 |
| 11b29a6a-8b1a-4078-b888-a8efab5b357f | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:43.125859+00:00 |
| b0389100-32da-4ec6-b633-f5c776517099 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:25:42.898059+00:00 |