| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 7fd7a70a-6094-4a4c-b1af-a98ea7f23d55 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-09T01:15:13.593964+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 7fd7a70a-6094-4a4c-b1af-a98ea7f23d55 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:13.593964+00:00 |
| 4ee0a91f-93b6-4344-ad6a-a86606b7eaca | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:13.485252+00:00 |
| bac98d64-02c9-45ed-a8c7-c4b32a5d513b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:13.384609+00:00 |
| ee1328c0-1bf4-4898-938f-0c2c82d05c58 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:13.294205+00:00 |
| 9c1af951-3c30-4963-96e9-8b01db43c62f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:13.206306+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 7fd7a70a-6094-4a4c-b1af-a98ea7f23d55 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:13.593964+00:00 |
| 4ee0a91f-93b6-4344-ad6a-a86606b7eaca | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:13.485252+00:00 |
| bac98d64-02c9-45ed-a8c7-c4b32a5d513b | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:13.384609+00:00 |
| ee1328c0-1bf4-4898-938f-0c2c82d05c58 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:13.294205+00:00 |
| 9c1af951-3c30-4963-96e9-8b01db43c62f | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:13.206306+00:00 |
| 5a677b0f-20a6-4385-8345-2219a95641e7 | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:13.088996+00:00 |
| afd3c175-d8d4-48e0-be27-f940abbba8a8 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:12.988797+00:00 |
| 2e62d632-8af9-4a6f-b5cc-44f062942883 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:12.886073+00:00 |
| 14790832-3929-47c1-aa47-ba19c837c6bc | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:12.769659+00:00 |
| cd84ceb2-7953-463b-9b0a-bf3bb6eacaeb | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:12.682844+00:00 |