Latest run: c54c26df-6b89-41ee-aaf2-289f7393feec | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-23T20:14:11.077810+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| c54c26df-6b89-41ee-aaf2-289f7393feec | coder | 89f0f5456c5b8670ca70d1a941ab0d7272df1310 | 0.310 | 2026-05-23T20:14:11.077810+00:00 |
| 28bf7eec-29e8-41d7-8b25-a657b2d70d30 | coder | 89f0f5456c5b8670ca70d1a941ab0d7272df1310 | 0.740 | 2026-05-23T20:14:11.014620+00:00 |
| 2ea1bf35-b3f2-4aaf-87df-c4613449a7d8 | coder | 89f0f5456c5b8670ca70d1a941ab0d7272df1310 | 0.310 | 2026-05-23T20:14:10.949929+00:00 |
| 9f31c71e-f58e-40e8-bf65-34ba0c3f0ea1 | coder | 89f0f5456c5b8670ca70d1a941ab0d7272df1310 | 0.740 | 2026-05-23T20:14:10.872611+00:00 |
| 2f55af0a-9138-4aa9-aab6-eeb55d45e8f2 | coder | 89f0f5456c5b8670ca70d1a941ab0d7272df1310 | 0.310 | 2026-05-23T20:14:10.809038+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| c54c26df-6b89-41ee-aaf2-289f7393feec | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:14:11.077810+00:00 |
| 28bf7eec-29e8-41d7-8b25-a657b2d70d30 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:14:11.014620+00:00 |
| 2ea1bf35-b3f2-4aaf-87df-c4613449a7d8 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:14:10.949929+00:00 |
| 9f31c71e-f58e-40e8-bf65-34ba0c3f0ea1 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:14:10.872611+00:00 |
| 2f55af0a-9138-4aa9-aab6-eeb55d45e8f2 | canary-python-regression-002 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:14:10.809038+00:00 |
| 60bb7a65-1166-41ba-b3b0-c990123fc6d5 | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:14:10.763391+00:00 |
| 4d789bc7-3381-4fb2-a86d-604bac9203a6 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-23T20:14:10.697695+00:00 |
| e45b2720-7c38-4a41-bf2a-b10454efb888 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-23T20:14:10.635174+00:00 |