| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: ec5f2b36-9f0a-4994-ba9b-7fbe67a8884f | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-09T01:15:22.419263+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| ec5f2b36-9f0a-4994-ba9b-7fbe67a8884f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:22.419263+00:00 |
| 233df12b-120d-4032-b069-6d81d2c40d97 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:22.340184+00:00 |
| 7068a3bc-bf2d-48c9-ba6f-effac5d9b064 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:22.260909+00:00 |
| 83c323d0-9bcb-42b5-991e-5ac888018865 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:22.159131+00:00 |
| 628b3497-67de-4999-9701-dd90b2254d56 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:15:22.036637+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| ec5f2b36-9f0a-4994-ba9b-7fbe67a8884f | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:22.419263+00:00 |
| 233df12b-120d-4032-b069-6d81d2c40d97 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:22.340184+00:00 |
| 7068a3bc-bf2d-48c9-ba6f-effac5d9b064 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:22.260909+00:00 |
| 83c323d0-9bcb-42b5-991e-5ac888018865 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:22.159131+00:00 |
| 628b3497-67de-4999-9701-dd90b2254d56 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:22.036637+00:00 |
| c099b5ee-9166-4929-b7e7-147b9b4a5dde | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:21.937852+00:00 |
| 9395886d-e1bb-477a-b0b6-7a3ae45aab86 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:21.856653+00:00 |
| 95bda6a0-fda8-4976-94ae-3b4b2779ed99 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:21.687003+00:00 |
| b9b4e9ab-d0aa-4ede-8b64-faf01af41887 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:21.598941+00:00 |
| 80997bc8-812a-4a12-96b8-302bf795d991 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:15:21.476612+00:00 |