| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: e913d500-d595-4206-87a3-65032bafd0b6 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-08T22:15:51.726498+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| e913d500-d595-4206-87a3-65032bafd0b6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:51.726498+00:00 |
| d4c91863-b858-4468-b8ac-f1fd11de8697 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:51.519745+00:00 |
| a0f69a30-4978-4b90-b4ee-239d93f4cf74 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:51.265142+00:00 |
| 65a3b91d-69ec-41ee-b9b2-cae372363e8e | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:51.140211+00:00 |
| 9b28db82-2229-44c6-8f8a-b82afd306494 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-08T22:15:51.013701+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| e913d500-d595-4206-87a3-65032bafd0b6 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:51.726498+00:00 |
| d4c91863-b858-4468-b8ac-f1fd11de8697 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:51.519745+00:00 |
| a0f69a30-4978-4b90-b4ee-239d93f4cf74 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:51.265142+00:00 |
| 65a3b91d-69ec-41ee-b9b2-cae372363e8e | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:51.140211+00:00 |
| 9b28db82-2229-44c6-8f8a-b82afd306494 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:51.013701+00:00 |
| 6bbbd22a-5d0f-4bd2-a7f6-d3af8ea2620d | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:50.841097+00:00 |
| b2f9d079-5d74-4038-b782-deb1bc1d6ccc | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:50.605978+00:00 |
| 947d7149-5f4a-4bfd-b513-f7d61fec29d0 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:50.464948+00:00 |
| cc13c085-7291-4a49-8ae3-a22f5bf44b80 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:50.324295+00:00 |
| 872d1a60-4225-4a9c-8716-a3758eb20285 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-08T22:15:50.198956+00:00 |