| Task ID | Band | Score | Passed | Cost |
|---|
| python-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-security-fix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-bugfix-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-performance-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-test-writing-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-multi-file-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-refactor-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| typescript-config-easy-001 | easy | 0.740 | ✓ | $0.0010 |
| python-recovery-easy-001 | easy | 0.740 | ✓ | $0.0010 |
Latest run: d59dc6a4-b907-4a96-b1b8-a7e9a70b9897 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-05-07T23:30:03.963766+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| d59dc6a4-b907-4a96-b1b8-a7e9a70b9897 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:03.963766+00:00 |
| a183bb57-1c80-419e-8695-a48461580bde | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:03.873048+00:00 |
| e811389a-df40-4cd8-885f-4f33d682a6ec | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:03.779626+00:00 |
| c682aaad-b049-4c95-8d5b-00a1d7e39b70 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:03.683545+00:00 |
| fc139d5d-bf96-48af-950f-1dc8991261b3 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-07T23:30:03.580229+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| d59dc6a4-b907-4a96-b1b8-a7e9a70b9897 | python-recovery-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.963766+00:00 |
| a183bb57-1c80-419e-8695-a48461580bde | typescript-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.873048+00:00 |
| e811389a-df40-4cd8-885f-4f33d682a6ec | python-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.779626+00:00 |
| c682aaad-b049-4c95-8d5b-00a1d7e39b70 | typescript-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.683545+00:00 |
| fc139d5d-bf96-48af-950f-1dc8991261b3 | python-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.580229+00:00 |
| 2ef5566b-1db9-4d95-9bd2-7bc8b2a5971f | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.494638+00:00 |
| b9c8f75d-2bd0-443d-aee4-f19cf710289b | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.407249+00:00 |
| 5a345a0b-9b3d-453e-8973-ff09963369e7 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.322140+00:00 |
| 92eec57e-52c7-496e-86c8-561c299344d9 | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.236514+00:00 |
| 422a9d85-564b-4f98-b473-ce01b09c2530 | typescript-performance-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-07T23:30:03.129601+00:00 |