Latest run: f33a4ae3-395d-4158-a72d-6866a8d86f88 | Latest model: coder | Latest score: 0.310 | Recorded at: 2026-05-09T13:27:30.689975+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| f33a4ae3-395d-4158-a72d-6866a8d86f88 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T13:27:30.689975+00:00 |
| a4dce959-75f0-48ee-bb33-e91193fec761 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T13:27:30.515573+00:00 |
| d7bac199-fe48-4a96-855b-a6c172ddf3b0 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-05-09T13:27:30.362633+00:00 |
| 25b36b13-5955-47a4-b2a6-85f475463abf | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T13:27:30.181879+00:00 |
| ac937dd1-8406-47de-8268-ad086277e5b6 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.310 | 2026-05-09T13:27:30.055673+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| f33a4ae3-395d-4158-a72d-6866a8d86f88 | canary-shell-ops-004 | wrong-file | 0.310 | $0.0010 | 2026-05-09T13:27:30.689975+00:00 |
| a4dce959-75f0-48ee-bb33-e91193fec761 | python-bugfix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T13:27:30.515573+00:00 |
| d7bac199-fe48-4a96-855b-a6c172ddf3b0 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T13:27:30.362633+00:00 |
| 25b36b13-5955-47a4-b2a6-85f475463abf | canary-python-cache-005 | wrong-file | 0.310 | $0.0010 | 2026-05-09T13:27:30.181879+00:00 |
| ac937dd1-8406-47de-8268-ad086277e5b6 | canary-typescript-auth-006 | wrong-file | 0.310 | $0.0010 | 2026-05-09T13:27:30.055673+00:00 |
| 7d9f4feb-4f3b-4508-807a-34e08ea205b3 | canary-python-security-001 | wrong-file | 0.310 | $0.0010 | 2026-05-09T13:27:29.952739+00:00 |
| 1c86e201-7d3c-458f-a273-4c7a920ca9b1 | canary-typescript-session-003 | wrong-file | 0.310 | $0.0010 | 2026-05-09T13:27:29.810274+00:00 |
| 02ce764d-b4a2-4c01-842c-e84796907b22 | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-05-09T13:27:29.699906+00:00 |