RAKI Evaluation Report

Run: eval-b6cd29be Timestamp: 2026-04-19 12:51:23 UTC Sessions: 6

Aggregate Scores

Operational Health
Verify rate
% sessions passing verify on first try
75%
Rework cycles
Average review-fix iterations per session
0.2
Severity score
Weighted severity of review findings (1.0 = no findings)
0.39
Cost / session
Average LLM cost per session in USD
$10.93
Knowledge miss rate
Fraction of rework caused by missing retrieval context
1.00
Retrieval Quality

Retrieval metrics require LLM judge — run without --no-llm to enable.

Recurring Failures

No recurring failures detected across sessions.

Worst 0 Sessions

No session ranking available (requires retrieval metrics).

Per-Session Drill-Down