Every reply shows where it routed; open the ? for the score, the features behind it, and the cost saved. Run the gateway with --dry-run for a keyless demo.
Wayfinder — quick guide
Wayfinder is a structural router: it scores the shape of your prompt — length, lists, code, headings, tables — and sends easy turns to a cheap local model and hard ones to a capable cloud model. The decision is deterministic, with no model call to make it.
Reading a reply
Pill — where it routed (local or cloud).
score — structural complexity (0–1) against the threshold.
? — the features behind the score and the cost saved vs always-cloud.
latched — the conversation latch kept a hard chat on the big model.
Settings
Threshold — move the local↔cloud cut for this chat.
Routing scope — what gets scored in a multi-turn chat.
Sticky / Cool-down — keep a hard chat on cloud, then let it drift back.
Advanced — tune feature weights, enable lexical signals, load a lexicon profile, and export it as config.
Models — which endpoints are wired and whether each API key is set.
Chats & folders
Conversations live in your browser — nothing is stored server-side. Start chats, drag them into folders (or use the ⋯ menu), pin, and rename from the sidebar.
API keys
Keys are never entered here: set the environment variable named in Settings → Models and restart. Run the gateway with --dry-run for a keyless demo.