| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 79c1f929-dda3-4d3b-8807-8155906852a5 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-27T16:26:17.282152+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 79c1f929-dda3-4d3b-8807-8155906852a5 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:26:17.282152+00:00 |
| f6ffa11d-be70-4073-8503-38bad4e7c576 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:26:17.213585+00:00 |
| 801bac5f-9774-4319-bf58-e643ce2cd7a9 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:26:17.084838+00:00 |
| a6f57e43-05fd-4a32-b06a-e2af45c0cbd3 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:26:16.976236+00:00 |
| 47c6f14a-6f59-4340-8cef-ce8f2a0a430c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:26:16.828889+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 79c1f929-dda3-4d3b-8807-8155906852a5 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:17.282152+00:00 |
| f6ffa11d-be70-4073-8503-38bad4e7c576 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:17.213585+00:00 |
| 801bac5f-9774-4319-bf58-e643ce2cd7a9 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:17.084838+00:00 |
| a6f57e43-05fd-4a32-b06a-e2af45c0cbd3 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:16.976236+00:00 |
| 47c6f14a-6f59-4340-8cef-ce8f2a0a430c | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:16.828889+00:00 |
| b31f8efc-46eb-4239-8bac-18b21b2ab650 | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:16.441985+00:00 |
| 166d7675-8445-42d3-b214-584245167db3 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:16.311928+00:00 |
| 00462b69-2240-4a90-abb9-7e2277397412 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:16.244545+00:00 |
| 8008ff61-1313-454c-8ed9-3496456385d1 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:16.176070+00:00 |
| 8633984f-f23f-44ae-ba78-de9530d5a412 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:26:16.107508+00:00 |