🔍 Sherlock Live Inspector

Bring any LLM — Gemini, OpenAI, Anthropic, an open-source-model host (DeepInfra · Together · OpenRouter), or a local OpenAI-compatible server — and watch Sherlock curate its context in real time.

Gemini
OpenAI
Anthropic
DeepInfra
Together
OpenRouter
Local

LLM-1 (search + fetch) and LLM-3 (freshness) both use this engine. DuckDuckGo is free but weak for news; Brave/Tavily/Valyu give far better results with a key.

💡 turbo runs LLM-3 inference + LLM-2 compaction every turn (the 🧠 Inference / 🗜 Compaction panels always fill). cold_start keeps it single-model until a real signal needs the companions; off is the legacy gate.