● AI Infrastructure OS · MIT · 0 telemetry
You are overspending on AI.

$4.82$0.00

Same tasks. Different intelligence. Real benchmark on MSI desktop (GTX 1650, 4GB VRAM, 8 threads): 5 tasks, 506 tokens, $0.00 cloud spend. Every task ran locally on qwen2.5-coder:7b.

Install in 30 seconds
pip install omniagent-fleet
omniagent web
Opens at http://localhost:8765 — dashboard with 5 tabs.
First time? Run omniagent status to see your hardware. Run omniagent node start to join a fleet.
Real benchmark · Jun 2026 · MSI desktop (GTX 1650)
Task Model Latency Cost
Review Python function qwen2.5-coder:7b 30,123ms $0.00
Write Google-style docstring qwen2.5-coder:7b 50,475ms $0.00
Rename variable qwen2.5-coder:7b 10,181ms $0.00
Explain TCP vs UDP qwen2.5-coder:7b 15,321ms $0.00
Classify bug ticket qwen2.5-coder:7b 8,649ms $0.00
Fleet: 1 node (msi-node) · Total: $0.00 · 506 tokens · avg 22.9s/task
🎯

Smart Routing

Tries local model first, falls back to cloud only when budget or quality demands it.

omniagent route "fix the login bug"
💸

Cost Tracking

Per agent, per model, per project. Real-time dashboard shows what each task costs.

omniagent status
🛡️

Guardian++ Audit

Scans for secrets, blocks destructive commands, verifies commits. Auto-rollback on failure.

guardian scan .
🔧

5-Tab Web Dashboard

Run, Optimize, Hardware, Agents, Monitor. Visual routing trace per task. No code needed.

omniagent web
🐝

Any Model, Any Provider

Claude, GPT-4, DeepSeek, Mistral, Ollama (local), vLLM. You control costs.

OLLAMA_BASE_URL=http://192.168.1.50:11434
🏠

100% Local-First

Runs on your hardware. No telemetry. No vendor lock-in. Zero marginal cost.

no API keys required

Hardware Detection — what your machine can do

CPU
8 threads
RAM
15.7 GB
VRAM
4.0 GB
GPU
GTX 1650
Tier
medium
Models
11 local
Run omniagent detect to see your hardware. Run omniagent optimize to find cheaper routes.

Built-in Agents — YAML-defined, composable

developer local
architect cloud
reviewer local
tester local
guardian audit
humanizer local
deployer cloud
Write your own: ~/.omniagent/agents/my-agent.yaml · Share as omniagent agent-install ./agent.yaml

Fleet Commands — orchestrate multiple machines

fleet add <url>
fleet list
fleet status
fleet remove <node>
fleet route "task"
fleet benchmark
Node: omniagent node start --port 8766 · Fleet router picks best node by free VRAM + CPU load.
100% private · 0 telemetry · MIT License · Code never leaves your machine unless you say so.