Latest run: 402e5321-f144-48ef-a869-b93b07a99a6f | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-27T16:05:40.790515+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 402e5321-f144-48ef-a869-b93b07a99a6f | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.790515+00:00 |
| 68fa95ec-85dc-4190-a031-f8e37c54547a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.718191+00:00 |
| 952c388a-99fa-4326-913d-c478e5e11d10 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.483822+00:00 |
| 03f60c91-cde3-4670-907f-26765aba3708 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.457325+00:00 |
| 1696b596-c346-4bc6-befc-4e9a30458cb6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-27T16:05:40.420944+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 402e5321-f144-48ef-a869-b93b07a99a6f | python-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.790515+00:00 |
| 68fa95ec-85dc-4190-a031-f8e37c54547a | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.718191+00:00 |
| 952c388a-99fa-4326-913d-c478e5e11d10 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.483822+00:00 |
| 03f60c91-cde3-4670-907f-26765aba3708 | typescript-explain-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.457325+00:00 |
| 1696b596-c346-4bc6-befc-4e9a30458cb6 | typescript-dependency-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.420944+00:00 |
| a5414f65-5720-435a-a3c5-0e5c4dd31d3c | typescript-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.379029+00:00 |
| fa97bee0-4552-4ea8-9e31-18c2d8a60767 | typescript-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.333625+00:00 |
| ebe39a68-4dd2-485b-86f1-a7b6afc91e9e | python-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.299347+00:00 |
| 695bc049-4219-413e-b8bc-960d1d302526 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.266624+00:00 |
| 90f013eb-ff40-4143-a5ef-7e8c24e3f3c1 | python-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-27T16:05:40.217042+00:00 |