Latest run: 0ce1e28e-9795-444b-8569-4efacd7921a8 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T01:16:01.891363+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 0ce1e28e-9795-444b-8569-4efacd7921a8 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T01:16:01.891363+00:00 |
| 812f866f-053a-4cb3-8bd7-1771deaedafc | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:16:01.747268+00:00 |
| fde9c1c4-afcc-442a-b240-01c9b3745ad8 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T01:16:01.631688+00:00 |
| 99f82c80-c85f-4f68-9513-07dc0af45520 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T01:16:01.498025+00:00 |
| 50d542c0-ba2d-4706-b457-39fca8f8493e | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T01:16:01.344756+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 0ce1e28e-9795-444b-8569-4efacd7921a8 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T01:16:01.891363+00:00 |
| 812f866f-053a-4cb3-8bd7-1771deaedafc | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:16:01.747268+00:00 |
| fde9c1c4-afcc-442a-b240-01c9b3745ad8 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:16:01.631688+00:00 |
| 99f82c80-c85f-4f68-9513-07dc0af45520 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T01:16:01.498025+00:00 |
| 50d542c0-ba2d-4706-b457-39fca8f8493e | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T01:16:01.344756+00:00 |
| 32527fe0-6c07-4dc7-b20b-fc9422b69345 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T01:16:01.207223+00:00 |
| 31f2e0fb-d015-41d4-9422-cff3bb0ffbd8 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T01:16:01.074416+00:00 |
| 02f1a03d-6840-46c4-a3f0-278511c47d9e | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T01:16:00.953857+00:00 |