Upload the actual human reviews and the AI review they should be compared to. The validation returns a calibration report — which persona prompts should be strengthened, which should be quieted, and what the human reviewers caught that the AI missed.
{% if not any_provider_configured %}config.yaml
directly — then come back here.
| Run | Human review file | Status | Started | |
|---|---|---|---|---|
{{ v.run_id }} |
{{ v.actual_filename or '—' }} | {{ v.status }} | {{ v.started_at[:19].replace('T', ' ') if v.started_at else '' }} | {% if v.status == 'done' %} View {% else %} Status {% endif %} |
No validations yet.
{% endif %}