| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 729f08bd-363f-4389-a64c-221780ac4f50 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-28T00:06:00.156327+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 729f08bd-363f-4389-a64c-221780ac4f50 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:06:00.156327+00:00 |
| 5dcb592c-76c1-4b3e-a1e3-9687f87b20d7 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:06:00.097171+00:00 |
| dc7e4b67-96c4-40b6-838f-2ab305cf8ad3 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:06:00.035072+00:00 |
| fedd8b01-a268-4c35-9eec-ddf54eb9eeea | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:05:59.972708+00:00 |
| 022d05e3-ca81-406a-907d-69ff61b5b103 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:05:59.911699+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 729f08bd-363f-4389-a64c-221780ac4f50 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:06:00.156327+00:00 |
| 5dcb592c-76c1-4b3e-a1e3-9687f87b20d7 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:06:00.097171+00:00 |
| dc7e4b67-96c4-40b6-838f-2ab305cf8ad3 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:06:00.035072+00:00 |
| fedd8b01-a268-4c35-9eec-ddf54eb9eeea | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:05:59.972708+00:00 |
| 022d05e3-ca81-406a-907d-69ff61b5b103 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:05:59.911699+00:00 |
| ef7c2f8e-4d10-4fc2-862b-d853c4700f3d | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:05:59.846632+00:00 |
| c35e3342-94cb-48bf-83d3-237e749701c3 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:05:59.790215+00:00 |
| 34fe05df-0543-4a73-a06a-60303d536a6a | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:05:59.731397+00:00 |
| d81e9fd7-01c0-476d-b3d2-00be37ea453e | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:05:59.670756+00:00 |
| 085091fd-0267-40dc-a971-37bae7c11c82 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:05:59.607265+00:00 |