A Python SDK that sits between your AI app and any LLM intercepting every prompt and response to enforce security policies, block data leaks, and detect threats. Everything runs locally.
1. The Cost Risk: CLI coding agents (Claude Code, Cursor, Aider) run autonomously. If they get stuck in an infinite retry loop or parse a massive log file, you wake up to a $50 API bill. You have zero visibility into session costs until the bill arrives.
2. The Security Risk: Coding agents have full access to your workspace. If you
accidentally leave an AWS_SECRET_KEY or .env credential in a file, the
agent will silently upload it to a third-party LLM provider.
Guardian Runtime is a zero-latency FinOps and Security firewall. It runs entirely on your local machine and sits directly between your coding agents and the LLM provider.
Session Analytics & Hard Budgets: Automatically tracks tokens and costs per session via the CLI. It sets a hard $10/day default limit so infinite loops never drain your credit card.
Local Secret Scanning: Instantly intercepts and blocks API keys, AWS credentials,
and .env secrets from ever leaving your local machine. Zero configuration required.
analytics command.
policy.yaml file.
If using the Proxy, developers see the block instantly inside the UI of their tool
(e.g. Claude Code chat) and in the background proxy logs.
If using the SDK, it surfaces in their standard Python server logs.
Zero crashes. In Proxy mode, Guardian cleanly returns a standard HTTP 400/403
error. This ensures CLI agents display an error message gracefully instead of crashing their
process.
In SDK mode, it raises a standard Python Exception.
No obscure stack traces. They see a completely transparent, actionable string telling them exactly
what policy they violated.
Example: 🚨 [BUDGET_EXCEEDED] Daily budget of $10.00
exceeded.