Execution Log
Harness runs with honest-rubric self_score, eval_score, divergence
Loading execution_log…
Recent runs
0 shown
| ID | Actor | Model | Self | Eval | Cost | Flagged |
|---|---|---|---|---|---|---|
| No execution log rows yet | ||||||
* marked costs are synthetic equivalents (actual token price not recorded at runtime).