Candlekeep RAG Benchmarks

Precision: Percentage of retrieved chunks that belong to relevant documents. (How "clean" are the results?)

Recall: Percentage of expected documents found. (Did we find everything we were looking for?)