| Task ID | Band | Score | Passed | Cost |
|---|
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 4c8a6ff3-fa85-46a2-9512-37cd2355331d | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-27T16:05:46.872704+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 4c8a6ff3-fa85-46a2-9512-37cd2355331d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:46.872704+00:00 |
| 11741f67-c8f4-4140-a287-11adb8ae5504 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:46.828636+00:00 |
| 56982985-032e-4069-a517-5a5780f99cbe | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:46.764011+00:00 |
| ddf78767-3583-4f6e-a5f0-6e669758958f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:46.732801+00:00 |
| 34870ade-8efa-4abf-af5d-72282dc60c13 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:46.699810+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 4c8a6ff3-fa85-46a2-9512-37cd2355331d | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.872704+00:00 |
| 11741f67-c8f4-4140-a287-11adb8ae5504 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.828636+00:00 |
| 56982985-032e-4069-a517-5a5780f99cbe | python-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.764011+00:00 |
| ddf78767-3583-4f6e-a5f0-6e669758958f | python-recovery-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.732801+00:00 |
| 34870ade-8efa-4abf-af5d-72282dc60c13 | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.699810+00:00 |
| 7d68d459-e9e8-4858-b7d6-256f739efa91 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.665454+00:00 |
| de145564-b704-4e01-8665-b1e4fcd2445a | python-explain-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.637349+00:00 |
| b00e38f6-a44e-4394-98ab-6499b816573b | python-dependency-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.607199+00:00 |
| 38d00448-c432-4415-b74f-9f4e846e93b1 | python-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.565000+00:00 |
| 9dcb56f6-4248-4bda-91b1-58af5de3648a | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:46.503746+00:00 |