Weave QuickDistill

Quick Start:
  1. Sync traces: Fetch a Weave project to import traces from your LLM calls
  2. Select Evaluation data: Choose a subset of traces to use as input for weak model evaluation and click 'Export selected to test set'
  3. Generate weak outputs: Run inference with smaller models on your test set
  4. Evaluate quality: Use judges to compare weak model responses against strong model outputs to find the best budget model
āœ… Fully supported: OpenAI (chat.completions, responses), Anthropic (Messages), Google Gemini (generate_content, Chat)
Total: 0
Shown: 0
šŸ“‹ MANUAL WORKFLOW
āš™ļø TOOLS
Judges
⚔ AUTOMATIC WORKFLOW
Export → Generate → Evaluate (all in one)
Loading traces...