Persistent memory for AI agents. Semantic search, knowledge graph, automatic consolidation — now accessible from Claude.ai via Streamable HTTP.
OpenAI-compatible quality scoring: point memory scoring at Ollama, LiteLLM, MLX-LM, or vLLM. Soft-delete UPDATE guards. ~1,758 tests.
Set MCP_QUALITY_AI_PROVIDER=openai-compatible to score memories with any OpenAI /v1/chat/completions-compatible endpoint — Ollama, LiteLLM, MLX-LM server, or vLLM. No cloud API key required. (PR #790)
New Tier 2 in the quality scoring chain: local ONNX → openai-compatible → Groq → Gemini → implicit signals. Endpoint failures fall through silently — no exception reaches the storage path.
Seven UPDATE memories SET ... statements in sqlite_vec.py now include AND deleted_at IS NULL, preventing accidental modification of tombstoned rows. (PR #783, @filhocf)
Claude.ai connects to your memory server through Streamable HTTP with OAuth 2.1 authentication.
Battle-tested with comprehensive testing and optimized for performance.
Install from PyPI and connect your AI agents to persistent memory.