| Task ID | Band | Score | Passed | Cost |
|---|
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: b1aa1fa3-578a-4644-ad0c-01b96b904848 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-27T16:05:41.615313+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| b1aa1fa3-578a-4644-ad0c-01b96b904848 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:41.615313+00:00 |
| 492c9f1e-0e57-4094-af06-4be5a8f11daa | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:41.550343+00:00 |
| 487c51aa-7d35-4d0a-bf55-033a4e43a014 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:41.519559+00:00 |
| f898f6e3-9514-4fd3-b22d-cd9f507915a6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:41.469300+00:00 |
| eb83e7a7-2d70-4fdc-9de9-7307f22e5df4 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:41.422240+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| b1aa1fa3-578a-4644-ad0c-01b96b904848 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.615313+00:00 |
| 492c9f1e-0e57-4094-af06-4be5a8f11daa | typescript-explain-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.550343+00:00 |
| 487c51aa-7d35-4d0a-bf55-033a4e43a014 | typescript-dependency-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.519559+00:00 |
| f898f6e3-9514-4fd3-b22d-cd9f507915a6 | typescript-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.469300+00:00 |
| eb83e7a7-2d70-4fdc-9de9-7307f22e5df4 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.422240+00:00 |
| 250665af-4f7b-41d7-9ab8-f06146a0cbd0 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.376636+00:00 |
| 3f837978-0397-431c-91cd-014e8e82bf93 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.354868+00:00 |
| d8908e04-54e9-4f32-9408-dfab86f8e883 | python-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.320671+00:00 |
| 54ca06b6-326b-400a-bbc3-56e55e585a51 | python-recovery-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.276053+00:00 |
| f7edaf6b-d942-41bc-8276-e44521e534ba | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:41.243064+00:00 |