| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 5ee40fc3-953f-4290-a3a2-42c45fffdb97 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-09T00:28:17.590182+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 5ee40fc3-953f-4290-a3a2-42c45fffdb97 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T00:28:17.590182+00:00 |
| ab6a1793-f881-405e-8c24-a16316f05a7d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T00:28:17.559126+00:00 |
| 330f8f02-d3df-4cc2-a841-5f0d7655f91c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T00:28:17.521265+00:00 |
| a9239e4f-50b3-4b86-a5aa-75550682bf80 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T00:28:17.485003+00:00 |
| 4386029c-a75a-478c-b23f-528df7976114 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T00:28:17.446275+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 5ee40fc3-953f-4290-a3a2-42c45fffdb97 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.590182+00:00 |
| ab6a1793-f881-405e-8c24-a16316f05a7d | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.559126+00:00 |
| 330f8f02-d3df-4cc2-a841-5f0d7655f91c | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.521265+00:00 |
| a9239e4f-50b3-4b86-a5aa-75550682bf80 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.485003+00:00 |
| 4386029c-a75a-478c-b23f-528df7976114 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.446275+00:00 |
| 9769d3ad-c8e1-4c0f-be48-0ddfc9eac517 | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.414662+00:00 |
| f5f81ba5-278a-4f4a-b912-4871ebb8d919 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.379231+00:00 |
| 81db5bdd-aeb9-4b3a-a8c7-bd793637f897 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.343197+00:00 |
| cb5af594-0163-40b7-ad4d-f96490c267c4 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.293334+00:00 |
| 764c025a-dd32-4a1e-8089-3744e1204abc | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T00:28:17.269443+00:00 |