| Task ID | Band | Score | Passed | Cost |
|---|
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 952c388a-99fa-4326-913d-c478e5e11d10 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-27T16:05:40.483822+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 952c388a-99fa-4326-913d-c478e5e11d10 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.483822+00:00 |
| 03f60c91-cde3-4670-907f-26765aba3708 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.457325+00:00 |
| 1696b596-c346-4bc6-befc-4e9a30458cb6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.420944+00:00 |
| a5414f65-5720-435a-a3c5-0e5c4dd31d3c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.379029+00:00 |
| fa97bee0-4552-4ea8-9e31-18c2d8a60767 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.333625+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 952c388a-99fa-4326-913d-c478e5e11d10 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.483822+00:00 |
| 03f60c91-cde3-4670-907f-26765aba3708 | typescript-explain-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.457325+00:00 |
| 1696b596-c346-4bc6-befc-4e9a30458cb6 | typescript-dependency-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.420944+00:00 |
| a5414f65-5720-435a-a3c5-0e5c4dd31d3c | typescript-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.379029+00:00 |
| fa97bee0-4552-4ea8-9e31-18c2d8a60767 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.333625+00:00 |
| ebe39a68-4dd2-485b-86f1-a7b6afc91e9e | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.299347+00:00 |
| 695bc049-4219-413e-b8bc-960d1d302526 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.266624+00:00 |
| 90f013eb-ff40-4143-a5ef-7e8c24e3f3c1 | python-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.217042+00:00 |
| f3aeb7ea-a968-45eb-9eee-846eca386728 | python-recovery-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.191709+00:00 |
| 9f22998a-b8d8-48d4-841a-0d2cdf3430e4 | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.150928+00:00 |