| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 88b86484-3953-4fb0-8dcd-01132b9d33d9 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-07T23:30:11.326922+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 88b86484-3953-4fb0-8dcd-01132b9d33d9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:11.326922+00:00 |
| 279b7aef-97e8-4345-b45e-9658ec333065 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:11.244499+00:00 |
| 695d4edb-fe59-4cc7-8c8e-941ed6ccc91d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:11.152576+00:00 |
| 871fe6ab-4791-4184-9a20-dde675b6d9d5 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:11.051834+00:00 |
| 8d6dc710-825f-4abf-8af7-efd6acc57942 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:10.973331+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 88b86484-3953-4fb0-8dcd-01132b9d33d9 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:11.326922+00:00 |
| 279b7aef-97e8-4345-b45e-9658ec333065 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:11.244499+00:00 |
| 695d4edb-fe59-4cc7-8c8e-941ed6ccc91d | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:11.152576+00:00 |
| 871fe6ab-4791-4184-9a20-dde675b6d9d5 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:11.051834+00:00 |
| 8d6dc710-825f-4abf-8af7-efd6acc57942 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:10.973331+00:00 |
| b09db729-273e-4c98-8541-a1543bb2f98e | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:10.888188+00:00 |
| e276e0b2-421a-40c3-b0d1-f8eecc8ef186 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:10.817910+00:00 |
| 7cdc3302-109b-4bb3-81dd-cbd0a6a09e34 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:10.722939+00:00 |
| 96071988-2dec-478e-b3e1-58e574e2d8d8 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:10.629244+00:00 |
| 3dabdfa7-1895-41ef-8c3a-9cc0d98eb5a2 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:10.521071+00:00 |