class documentation
RAG (Retrieval-Augmented Generation) provider using Qdrant and Ollama
| Method | chat |
Send messages using RAG with vector search and LLM (normalized to OpenAI format) |
| Method | _get |
Get cached embedding model (singleton pattern to avoid repeated HuggingFace API calls) |
Inherited from LLMProvider:
| Method | __init__ |
Undocumented |
| Instance Variable | context |
Undocumented |
| Method | _prepare |
Convert string message to proper messages array |
overrides
zms.llmapi.LLMProvider.chatSend messages using RAG with vector search and LLM (normalized to OpenAI format)