| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-dependency-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-explain-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-security-fix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-bugfix-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-performance-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-test-writing-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-multi-file-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-refactor-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-config-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-recovery-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-dependency-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| shell-explain-medium-001 | medium | 0.740 | ✓ | $0.0010 |
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: 8bef2842-567f-4f02-a775-ef3493381090 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-27T16:10:31.540943+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 8bef2842-567f-4f02-a775-ef3493381090 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:10:31.540943+00:00 |
| 9fd9cfb5-0ec7-454a-8aaf-da945ec53406 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:10:31.516954+00:00 |
| 524f3a2a-f395-4d2a-9af0-f25d37176a3a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:10:31.486558+00:00 |
| 44746b02-0eba-46f9-99fd-07c200c37acf | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:10:31.417888+00:00 |
| 9234b708-d34d-4f2b-a20a-5661cfbe49ef | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:10:31.358403+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 8bef2842-567f-4f02-a775-ef3493381090 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.540943+00:00 |
| 9fd9cfb5-0ec7-454a-8aaf-da945ec53406 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.516954+00:00 |
| 524f3a2a-f395-4d2a-9af0-f25d37176a3a | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.486558+00:00 |
| 44746b02-0eba-46f9-99fd-07c200c37acf | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.417888+00:00 |
| 9234b708-d34d-4f2b-a20a-5661cfbe49ef | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.358403+00:00 |
| 7606c9dd-8780-4235-aeb4-bb1c32a6d3b0 | python-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.299956+00:00 |
| f4162e98-e728-4143-9df5-7bc0e83f1a7d | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.230564+00:00 |
| a510f33b-3188-45b9-9200-65af4a710c77 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.189312+00:00 |
| 6820ff7b-a132-4c59-83ea-cf5b0fc8a61d | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.137953+00:00 |
| cf810e5e-c499-4de4-8281-48b5568c4e7b | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:10:31.096905+00:00 |