| Task ID | Band | Score | Passed | Cost |
|---|
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 6209fc06-6dcc-434b-bf94-775356a2fc64 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-27T16:05:44.789461+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 6209fc06-6dcc-434b-bf94-775356a2fc64 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:44.789461+00:00 |
| abf48a9b-4db2-47c5-b65c-5c2c5e681fa3 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:44.749019+00:00 |
| 5a7afe15-8aa1-4e97-abad-fa609c91017d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:44.712390+00:00 |
| a684150b-c575-4cff-8529-eeb3f473fdcb | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:44.674436+00:00 |
| 2c1b5792-aaef-4fb9-b83e-cebfb43d5730 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:44.632844+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 6209fc06-6dcc-434b-bf94-775356a2fc64 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.789461+00:00 |
| abf48a9b-4db2-47c5-b65c-5c2c5e681fa3 | typescript-explain-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.749019+00:00 |
| 5a7afe15-8aa1-4e97-abad-fa609c91017d | typescript-dependency-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.712390+00:00 |
| a684150b-c575-4cff-8529-eeb3f473fdcb | typescript-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.674436+00:00 |
| 2c1b5792-aaef-4fb9-b83e-cebfb43d5730 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.632844+00:00 |
| 8f4b1fae-f3e5-4655-9b71-77898e9102c6 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.609658+00:00 |
| ae3a73d2-5b65-4a5c-9fbe-8f38165897f9 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.580193+00:00 |
| 48686b39-226f-4131-9efc-de02c2f0451a | python-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.546089+00:00 |
| 7798175b-63f1-4e2c-ac1c-9588cf2c4828 | python-recovery-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.516886+00:00 |
| aa226740-6e79-4229-aa32-e059fb1bd1ef | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:44.476851+00:00 |