| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 03902f7b-992d-46b1-9ca8-2e2fd3bcfe83 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-07T23:10:25.184450+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 03902f7b-992d-46b1-9ca8-2e2fd3bcfe83 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:10:25.184450+00:00 |
| 15a79eae-b845-479e-a139-7b95ffe9c8e0 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:10:25.065198+00:00 |
| d472f365-46aa-4996-85ad-1773ff2de407 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:10:24.961718+00:00 |
| bae7a87c-d953-4e3d-a4cc-b501f14d3945 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:10:24.858234+00:00 |
| 4a1a1641-7291-4c48-b236-ed3f64fe28d9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:10:24.754481+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 03902f7b-992d-46b1-9ca8-2e2fd3bcfe83 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:25.184450+00:00 |
| 15a79eae-b845-479e-a139-7b95ffe9c8e0 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:25.065198+00:00 |
| d472f365-46aa-4996-85ad-1773ff2de407 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:24.961718+00:00 |
| bae7a87c-d953-4e3d-a4cc-b501f14d3945 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:24.858234+00:00 |
| 4a1a1641-7291-4c48-b236-ed3f64fe28d9 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:24.754481+00:00 |
| d2a7067b-3a0f-4cbf-8077-7a9960316139 | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:24.649202+00:00 |
| eb993dbe-3894-4d5d-bb03-b3143a0b33ca | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:24.546346+00:00 |
| 21f3ec0e-7d0a-45b7-b317-cf1004ee0593 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:24.432185+00:00 |
| 58f2253d-55d8-463a-9c52-6d70f190b02f | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:24.322749+00:00 |
| 24de33a4-4e3d-4ac1-b53f-d2e87e86f42e | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:10:24.211213+00:00 |