Latest run: 492fadd4-8806-4ab3-850a-12b46966a945 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T14:38:02.045650+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 492fadd4-8806-4ab3-850a-12b46966a945 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T14:38:02.045650+00:00 |
| 5b3e538c-30d8-4e51-89ed-d27b72a38302 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T14:38:01.922015+00:00 |
| 10029a98-1010-4ab9-b81c-4b4040ef903a | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T14:38:01.807000+00:00 |
| 72f9b3d9-fb75-4cd1-8cd3-5fc7e1019661 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T14:38:01.687581+00:00 |
| bf5a99dc-b743-4040-b22e-56c38c75877d | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T14:38:01.569198+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 492fadd4-8806-4ab3-850a-12b46966a945 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:38:02.045650+00:00 |
| 5b3e538c-30d8-4e51-89ed-d27b72a38302 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T14:38:01.922015+00:00 |
| 10029a98-1010-4ab9-b81c-4b4040ef903a | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T14:38:01.807000+00:00 |
| 72f9b3d9-fb75-4cd1-8cd3-5fc7e1019661 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:38:01.687581+00:00 |
| bf5a99dc-b743-4040-b22e-56c38c75877d | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:38:01.569198+00:00 |
| d4f6c017-fc87-479a-b225-c0b490697661 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:38:01.451347+00:00 |
| ab505faa-4626-49c5-bba3-06e12d83fdad | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T14:38:01.327117+00:00 |
| 21871011-3cad-4362-b995-5171571990b9 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T14:38:01.202306+00:00 |