Latest run: 07a39ebc-7c0c-4e06-95d6-b3f0b7c99824 | Latest model: coder | Latest score: 0.740 | Recorded at: 2026-04-28T00:18:53.882650+00:00
| Run ID | Model | Git SHA | Score | Created |
|---|
| 07a39ebc-7c0c-4e06-95d6-b3f0b7c99824 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:18:53.882650+00:00 |
| bbc10a77-d014-4653-9809-8c2139385e2c | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:18:53.805023+00:00 |
| 41f6e544-f9ef-4a96-9f80-341d6717e030 | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:18:53.302265+00:00 |
| e3b0abc1-6d99-418e-bb91-74a862e5d4fb | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:18:53.224427+00:00 |
| ff324ac3-99f0-4e27-b45a-f66c6c6ace5b | coder | 4669773b4fbe9d507f1396f38777a1b36998faf3 | 0.740 | 2026-04-28T00:18:53.142669+00:00 |
| Run ID | Task ID | Taxonomy | Score | Cost | Created |
|---|
| 07a39ebc-7c0c-4e06-95d6-b3f0b7c99824 | typescript-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:53.882650+00:00 |
| bbc10a77-d014-4653-9809-8c2139385e2c | python-security-fix-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:53.805023+00:00 |
| 41f6e544-f9ef-4a96-9f80-341d6717e030 | python-recovery-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:53.302265+00:00 |
| e3b0abc1-6d99-418e-bb91-74a862e5d4fb | typescript-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:53.224427+00:00 |
| ff324ac3-99f0-4e27-b45a-f66c6c6ace5b | python-config-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:53.142669+00:00 |
| 1090529f-5914-41e1-b45c-16ffd270c7bd | typescript-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:53.066943+00:00 |
| 7d27d79d-1b14-493c-8378-2cd4ec90eb17 | python-refactor-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:52.969914+00:00 |
| f4e212b5-20fe-42cd-b7ac-b3f873fbef94 | typescript-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:52.894174+00:00 |
| a8f6806b-f005-4c46-8957-6ee7f272f7f3 | python-multi-file-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:52.806501+00:00 |
| 914cc740-0cea-4aac-b3ed-fadf245485c8 | typescript-test-writing-easy-001 | wrong-logic | 0.740 | $0.0010 | 2026-04-28T00:18:52.740924+00:00 |