KB Arena Leaderboard
Aggregated benchmark scores across every run in this deployment. Higher accuracy + Recall@5 + NDCG@5 are better; lower cost + latency are better. To submit a run, open a PR with your results/run_* JSON.
Loading…
Aggregated benchmark scores across every run in this deployment. Higher accuracy + Recall@5 + NDCG@5 are better; lower cost + latency are better. To submit a run, open a PR with your results/run_* JSON.
Loading…