| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 665a0079-ecc0-4690-8919-9ce5a1f9f612 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-27T16:59:01.724935+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 665a0079-ecc0-4690-8919-9ce5a1f9f612 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:59:01.724935+00:00 |
| 2f9ffbc1-6c36-4310-9bd4-af8e7718adeb | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:59:01.650766+00:00 |
| babc41ec-f1de-4dba-b68d-e6ca05f77b22 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:59:01.576701+00:00 |
| 4fffd973-19d2-4c50-b097-6e225ac97358 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:59:01.505149+00:00 |
| 718ee58d-721c-4df4-af64-3ecf1efc2872 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:59:01.446363+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 665a0079-ecc0-4690-8919-9ce5a1f9f612 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.724935+00:00 |
| 2f9ffbc1-6c36-4310-9bd4-af8e7718adeb | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.650766+00:00 |
| babc41ec-f1de-4dba-b68d-e6ca05f77b22 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.576701+00:00 |
| 4fffd973-19d2-4c50-b097-6e225ac97358 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.505149+00:00 |
| 718ee58d-721c-4df4-af64-3ecf1efc2872 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.446363+00:00 |
| 4b5981a2-5973-4471-b8af-edfda51dc51f | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.374480+00:00 |
| 671dca85-d782-49ae-b97e-a9319e968c1c | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.268796+00:00 |
| 8670191a-ec89-4bb3-a1a4-32b564ffc4db | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.188281+00:00 |
| d60bcefa-381e-4569-9f01-cbf291fbb2b1 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.114358+00:00 |
| 2cd1733c-6b6c-438a-9448-c603a91049a6 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:59:01.017433+00:00 |