[How to run pairwise evaluations | 🦜️🛠️ LangSmith](https://docs.smith.langchain.com/evaluation/how_to_guides/evaluate_pairwise): LLM should read this page when comparing outputs from multiple experiments against each other, setting up pairwise evaluations, or implementing LLM-as-judge comparisons. This page explains how to run pairwise evaluations in LangSmith, covering the evaluate() function arguments, defining custom pairwise evaluators, handling evaluator inputs/outputs, running evaluations, and viewing results in the LangSmith UI.

