| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 84d0cd35-70d4-4107-b4d1-8efdbcec0e68 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-08T21:29:43.007324+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 84d0cd35-70d4-4107-b4d1-8efdbcec0e68 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T21:29:43.007324+00:00 |
| c54fdd2f-fee0-486b-bc2a-604601757018 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T21:29:42.922198+00:00 |
| d11d45fd-6cfe-4300-ae00-0e0a9fa10065 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T21:29:42.766460+00:00 |
| bf104225-18f3-425f-b0b9-38f994a26f0f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T21:29:42.614360+00:00 |
| 2a09143f-c705-43aa-8a7b-0dd1a9fae937 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T21:29:42.470055+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 84d0cd35-70d4-4107-b4d1-8efdbcec0e68 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:43.007324+00:00 |
| c54fdd2f-fee0-486b-bc2a-604601757018 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.922198+00:00 |
| d11d45fd-6cfe-4300-ae00-0e0a9fa10065 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.766460+00:00 |
| bf104225-18f3-425f-b0b9-38f994a26f0f | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.614360+00:00 |
| 2a09143f-c705-43aa-8a7b-0dd1a9fae937 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.470055+00:00 |
| 5e120828-1c08-4efd-9350-1b3b05747a46 | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.367288+00:00 |
| e2fc1128-782e-4428-b519-1b761ade4ae2 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.259512+00:00 |
| 7167de6a-bb1d-4207-99c4-ff44eccfa4e1 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.188349+00:00 |
| 9de9099a-2994-4fe5-aee5-7fcee841ece3 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.115128+00:00 |
| f4eedf8a-b683-452c-a168-d89a0365d876 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T21:29:42.023060+00:00 |