Weave QuickDistill

Quick Start:
  1. Sync traces: Fetch a Weave project to import traces from your LLM calls
  2. Select Evaluation data: Choose a subset of traces to use as input for weak model evaluation and click 'Export selected to test set'
  3. Generate weak outputs: Run inference with smaller models on your test set
  4. Evaluate quality: Use judges to compare weak model responses against strong model outputs to find the best budget model
Primary supported: openai.chat.completions.create
šŸ“‹ Manual Workflow (Step-by-Step):
Manage Judges
Automatic Workflow:
Export → Generate → Evaluate (all in one)
Total: 0
Shown: 0
Loading traces...