๐ค
Autonomous agent โ LLM-driven tool-use loop
An LLM-driven tool-use loop over your full Praxia stack โ personal memory, org memory, frozen layer, skills, connectors. The agent picks tools on its own (search โ run skill โ pull connector โ answer) with ACL gates and audit logging. Ships as praxia.agent.AutonomousAgent, praxia agent run, and an MCP meta-tool for remote clients.
โจ
Prompt Designer โ turn intent into a polished template
Describe the task in one line ("score contract risk 1-5 in JSON") โ get a production-grade prompt design back: tuned system message, ${variable} user template, 2-3 few-shot examples, 5-criterion evaluation rubric. Per-LLM idioms applied automatically (Claude XML / OpenAI JSON-mode / DeepSeek-R1 reasoning channel / Mistral concise / Llama numbered steps). Save to PromptStore, A/B-test multiple variants via the experiments framework.
๐ง
Personal โ org memory loop
Senior staff's "magic prompts" auto-promote into shared knowledge via three independent paths: frequency, outcome correlation, and LLM self-eval.
๐
3-path promotion engine
Frequency-based, outcome-correlated, LLM-scored. Run in parallel โ never depending on a single signal. Configurable thresholds for auto-promote vs review.
โ๏ธ
Workflow-specialized flows
Sales prep, logic checking, RAG self-correction โ three production-ready multi-agent pipelines that run in 5 minutes. No bespoke orchestration code required.
๐ฏ
6 default business skills
Investment, sales, design, purchasing, patent, legal โ domain-tuned agents with built-in guardrails (tax law, jurisdictional caveats, hallucination guards).
๐ก๏ธ
Evidence by default
Sentence-level hallucination detection and retrieval metrics ship as first-class modules. "It works" comes with proof attached.
๐
6 LTM backends
JSON, Mem0, LangMem, Letta, Zep, HindSight โ switch with one line. Plus Graph layer (optional) for relationship-heavy domains. Zero vendor lock-in.
๐ค
Multi-LLM (100+ providers)
Claude, ChatGPT, Gemini, Qwen-API, Qwen-local (Ollama) + 100+ via LiteLLM. Auto-detect from env vars; switch model per-call.
๐
Auth, RBAC, SSO, audit โ in OSS
API key + JWT + OIDC (Google/MS/Okta/GitHub/Keycloak) + 4 default roles + append-only audit log. Most competitors paywall this.
๐งฌ
Skills also promoted
Not just memory โ your personal skills get tracked, scored, and promoted to the org skill catalog when they prove themselves.
๐ฌ
MCP / Claude Skills compatible
Skills serialize to standard SKILL.md. Drop into Claude Skills, Cursor Skills, or any MCP-compatible registry without code changes.
๐
Outcome tracking built-in
record_outcome() attaches success/failure to episodes. The consolidator uses these signals statistically โ no separate analytics pipeline needed.
๐
Apache 2.0 + Open Core ready
Permissive license, commercial-friendly. NOTICE.md inventories every dependency's license. Open Core path for enterprise extras planned.
๐ก
Resource access policies (ACL)
Glob-pattern allow / deny rules per resource type (connector, memory, prompt, skill). Built for enterprise IS departments. Every decision audit-logged.
๐พ
Admin data exports
CSV / JSON / JSONL exports of audit log, users, usage, memory, policies โ for compliance, SIEM, backups. Each export action self-audited.
๐
6 storage / SaaS connectors
Box, SharePoint, Dropbox, Google Drive, kintone, Salesforce โ bidirectional Pull + Push. ACL-enforced before any external call.
๐
Personal & org dashboards
Flow / skill counts, success rate, top users, promoted blocks, frozen files, distributed skills โ out of the box, with no separate analytics pipeline.
๐
Custom prompt distribution
Users save personal prompts. Admins promote them to org or push to specific roles / users. Three scopes with merge precedence.
๐ฅ
Full admin user CRUD
Create / update / delete / deactivate / rotate keys / change roles โ all via CLI, UI, or SDK. All operations audited.
๐
File parsers (PDF ยท Word ยท Excel ยท PowerPoint ยท CSV ยท HTML)
Drop a file in โ auto-dispatch by extension. PDF page-by-page, Word with heading detection, Excel as Markdown tables, PowerPoint with speaker notes. Custom formats register via entry points.
๐
Voice input + voice output
Speech-to-text (Whisper) and text-to-speech (OpenAI TTS / ElevenLabs / Piper). Embedded in Streamlit UI as record-and-go input and read-aloud output.
๐
User-delegated OAuth
Each Praxia user authorizes Box / SharePoint / Dropbox / Drive / Salesforce with their own credentials. The external system's native ACL is enforced per Praxia user โ alice can only see what alice has access to.
โ๏ธ
Legal templates
Terms of Service, Privacy Policy, Acceptable Use, Cookie Policy โ starter templates wired into the portal sign-up. Marked clearly as templates requiring legal review before commercial use.
๐งฌ
Multi-LTM fusion + routing
Run several LTMs in parallel and fuse with Reciprocal Rank Fusion โ or route per query (temporal โ Zep, audit โ JSON, entity โ Mem0). English + Japanese keyword detection. Higher recall without picking a winner.
๐ชช
Memory mode toggle
Per-user switch: accumulate (default) or read_only. Read-only sessions silently drop writes โ useful for sensitive content. Admins can lock the mode tenant-wide or by role.
๐ก
Admin-controlled LTM policy
Pin which backend(s) users may pick and what the default mode is, at the tenant level. Resolution: admin enforced > call-site > user pref > admin default.
๐จ
Output exporters (HTML ยท PPTX ยท DOCX ยท MD ยท JSON)
Skills produce Markdown by default. OutputFormatSkill infers requested format from natural-language hints ("ใใฏใใง" โ PPTX, "as a Word doc" โ DOCX). Custom formats register via entry-point.
๐ฆ
Gemma support
Google's open-weight family added. gemma / gemma-2b / gemma-9b / gemma-27b via local Ollama; gemma-cloud via Google Vertex AI.
๐
Backend-only or full-stack
Use Praxia as a brain behind your own frontend (SDK embed or praxia serve FastAPI HTTP API), or run the bundled Streamlit UI for the fastest path. Same auth, memory, skills.
๐
KMS-backed token encryption
OAuth tokens use envelope encryption โ fresh DEK per write, AES-GCM payload, DEK wrapped by your KMS. 5 adapters: local / aws / azure / gcp / vault. Master key never lives on the application host.
๐
Production OAuth callback (HTTP)
praxia serve exposes /api/v1/oauth/{provider}/{start,callback,status}. Multi-worker safe state cache (TTL-pruned JSON), pinned redirect URI via PRAXIA_PUBLIC_URL, optional success-redirect to your frontend.
โ๏ธ
A/B experiments built in
Test prompt variants on real users with deterministic per-user assignment (SHA-256 bucket). Audience filter (roles / users / window). Outcome rollup + tentative winner detection. CLI + SDK.
๐ฏ
LLM output quality eval (CI gate)
Catch quality regressions before merge. tests/llm_eval/ grades real LLM output against rubrics + a committed baseline. Score drop > 5pt fails the build. Per-skill cases ship for all 6 skills.
๐งฎ
Hermetic test harness โ stubs & drivers for every surface
Every public surface (auth / memory / fusion / exporters / OAuth / parsers / CLI / extensions / experiments / connectors / agent) ships with backend stubs, fixture factories, and protocol-conforming drivers โ so contributors can write hermetic tests without standing up real services. CI runs them on every PR.
๐
20 storage / SaaS connectors
Box / SharePoint / Dropbox / Drive / kintone / Salesforce + Notion / Confluence / Jira / Slack / Teams / GitHub / HubSpot / Zendesk / Linear / S3 / Azure Blob / GCS / WebDAV / Email. Per-user OAuth means alice only sees what alice can in each system.
๐ช
MCP server (stdio + remote HTTP/SSE)
Use Praxia from Claude Desktop / Cursor / Continue.dev. Local: praxia mcp serve. Remote (multi-host): praxia serve exposes /api/v1/mcp with auth + audit log. Every skill + flow becomes an MCP tool automatically.
๐ฑ
Mobile-responsive UI + landing
Landing has chip-style nav on phones, scrollable tabs, โฅ44px touch targets, prefers-reduced-motion respected. Streamlit UI injects responsive CSS + a "Compact mode" toggle for slow connections.