Latest run: 643b7e7c-8ef8-4c27-b30c-696079dcdc7e | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T03:02:06.485319+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 643b7e7c-8ef8-4c27-b30c-696079dcdc7e | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T03:02:06.485319+00:00 |
| 675e0eca-6e38-4acf-93c5-1aff3fcdde6c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T03:02:06.394881+00:00 |
| 6777c77d-f408-4d5c-b47c-72ced5377a0c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T03:02:06.286787+00:00 |
| 66b1eb7a-98f9-48dc-b888-33574988194b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T03:02:06.213197+00:00 |
| d8080741-e705-41f0-8b33-575e544a3038 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T03:02:06.140626+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 643b7e7c-8ef8-4c27-b30c-696079dcdc7e | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:02:06.485319+00:00 |
| 675e0eca-6e38-4acf-93c5-1aff3fcdde6c | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T03:02:06.394881+00:00 |
| 6777c77d-f408-4d5c-b47c-72ced5377a0c | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T03:02:06.286787+00:00 |
| 66b1eb7a-98f9-48dc-b888-33574988194b | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:02:06.213197+00:00 |
| d8080741-e705-41f0-8b33-575e544a3038 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:02:06.140626+00:00 |
| c0188816-fcbe-48b7-b603-d4c3ecde83bf | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:02:06.052306+00:00 |
| ecd559c7-f366-45a2-b9e8-08e0b840aa10 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T03:02:05.933685+00:00 |
| 647814c3-3c5a-463f-8f75-b5d0bcfcc6f8 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T03:02:05.777423+00:00 |