Latest run: 81c3abc5-2c1c-4444-82da-eeb9cec592db | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-26T10:34:34.437769+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 81c3abc5-2c1c-4444-82da-eeb9cec592db | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.310 | 2026-05-26T10:34:34.437769+00:00 |
| cbde3298-00d5-40e5-9978-79f7a737301a | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.740 | 2026-05-26T10:34:34.349793+00:00 |
| 4c6e81ec-23a2-4339-80c2-99a85642619d | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.310 | 2026-05-26T10:34:34.264734+00:00 |
| 8b9c0291-45dc-4ecd-893c-02352a242eea | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.740 | 2026-05-26T10:34:34.164910+00:00 |
| 37b7d369-9274-4099-997a-0bd54cd66aa4 | coder | 59417e3b6834192b1ea96a6a9010dee3105efd78 | 0.310 | 2026-05-26T10:34:34.066321+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 81c3abc5-2c1c-4444-82da-eeb9cec592db | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:34:34.437769+00:00 |
| cbde3298-00d5-40e5-9978-79f7a737301a | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-26T10:34:34.349793+00:00 |
| 4c6e81ec-23a2-4339-80c2-99a85642619d | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:34:34.264734+00:00 |
| 8b9c0291-45dc-4ecd-893c-02352a242eea | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-26T10:34:34.164910+00:00 |
| 37b7d369-9274-4099-997a-0bd54cd66aa4 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:34:34.066321+00:00 |
| 6e78c3e9-febc-44e3-957d-72a81e43b58b | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:34:33.967247+00:00 |
| b457faff-bc19-4e1e-913a-8090a7301970 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-26T10:34:33.882724+00:00 |
| ed45ffc5-783b-4b4e-8be0-d34b80e96f2b | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-26T10:34:33.787052+00:00 |