Configure your evaluation backend
We need your API key to access your datasets and evaluators.
Choose the dataset to evaluate your target function against.
Sample rows from your dataset.
Select the files Weco will optimize, then choose the target function.
agent:run_chain imports run_chain from agent.pyHow does your target function accept inputs?
Pass the full inputs dict directly.
def target(inputs: dict) -> dict
Your target is a LangChain Runnable. We call .invoke(inputs).
Extracts a single string from common keys (input, question, text, query).
Evaluators score your target function's outputs on each dataset example.
module:function for custom evaluators, or built-in names like exact_match.
Leave empty to default to your metric name ().
Configure optimization parameters. All fields have sensible defaults.
Confirm your configuration and begin optimization.
Next time, skip the wizard with: