| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 869c4579-9d85-4947-81a1-38d061607bfa | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-08T22:15:40.646943+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 869c4579-9d85-4947-81a1-38d061607bfa | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:40.646943+00:00 |
| 3da3e0c6-ab1a-48bf-a3d7-4ce116c5d06a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:40.573576+00:00 |
| 3072b112-5d7d-4891-8c11-f920d75044ef | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:40.478923+00:00 |
| 5ee45116-dc9d-4223-8d20-a3909ae601dd | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:40.413275+00:00 |
| 9cd157b1-9481-428b-b93b-bcd32e93dd96 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:40.324103+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 869c4579-9d85-4947-81a1-38d061607bfa | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.646943+00:00 |
| 3da3e0c6-ab1a-48bf-a3d7-4ce116c5d06a | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.573576+00:00 |
| 3072b112-5d7d-4891-8c11-f920d75044ef | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.478923+00:00 |
| 5ee45116-dc9d-4223-8d20-a3909ae601dd | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.413275+00:00 |
| 9cd157b1-9481-428b-b93b-bcd32e93dd96 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.324103+00:00 |
| 2108c834-9d38-48e1-85a5-0ffbba29430f | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.237741+00:00 |
| 1ea2823d-e29b-4fd4-b94d-1f189d759382 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.169861+00:00 |
| 9c00ff15-6059-402d-af0b-c9a7f4ff72d7 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.100725+00:00 |
| 47839d03-0b0f-4532-83c6-28fe54235460 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:40.017320+00:00 |
| 67d6fe0f-3428-4ed9-85e2-92b90e5b22a0 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:39.935096+00:00 |