Metadata-Version: 2.4
Name: ragbits-document-search
Version: 1.7.0.dev202604240307
Summary: Document Search module for Ragbits
Project-URL: Homepage, https://github.com/deepsense-ai/ragbits
Project-URL: Bug Reports, https://github.com/deepsense-ai/ragbits/issues
Project-URL: Documentation, https://ragbits.deepsense.ai/
Project-URL: Source, https://github.com/deepsense-ai/ragbits
Author-email: "deepsense.ai" <ragbits@deepsense.ai>
License-Expression: MIT
Keywords: Document Search,GenAI,Generative AI,LLMs,Large Language Models,RAG,Retrieval Augmented Generation
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.10
Requires-Dist: docling[easyocr]<2.66.0,>=2.65.0
Requires-Dist: filetype<2.0.0,>=1.2.0
Requires-Dist: opencv-python<5.0.0.0,>=4.11.0.86
Requires-Dist: python-pptx<2.0.0,>=1.0.0
Requires-Dist: ragbits-core==1.7.0.dev202604240307
Requires-Dist: rerankers<1.0.0,>=0.6.1
Provides-Extra: ray
Requires-Dist: pyarrow<19.0.0,>=18.0.0; extra == 'ray'
Requires-Dist: ray[data]~=2.52.1; extra == 'ray'
Provides-Extra: unstructured
Requires-Dist: unstructured-client<1.0.0,>=0.26.0; extra == 'unstructured'
Requires-Dist: unstructured<1.0.0,>=0.18.18; extra == 'unstructured'
Description-Content-Type: text/markdown

# Ragbits Document Search

Ragbits Document Search is a Python package that provides tools for building RAG applications. It helps ingest, index, and search documents to retrieve relevant information for your prompts.

## Installation

You can install the latest version of Ragbits Document Search using pip:

```bash
pip install ragbits-document-search
```

## Quickstart
```python
import asyncio

from ragbits.core.embeddings import LiteLLMEmbedder
from ragbits.core.vector_stores.in_memory import InMemoryVectorStore
from ragbits.document_search import DocumentSearch

async def main() -> None:
    """
    Run the example.
    """
    embedder = LiteLLMEmbedder(
        model_name="text-embedding-3-small",
    )
    vector_store = InMemoryVectorStore(embedder=embedder)
    document_search = DocumentSearch(
        vector_store=vector_store,
    )

    # Ingest all .txt files from the "biographies" directory
    await document_search.ingest("local://biographies/*.txt")

    # Search the documents for the query
    results = await document_search.search("When was Marie Curie-Sklodowska born?")
    print(results)


if __name__ == "__main__":
    asyncio.run(main())
```

## Documentation
* [Quickstart 2: Adding RAG Capabilities](https://ragbits.deepsense.ai/quickstart/quickstart2_rag/)
* [How-To Guides - Document Search](https://ragbits.deepsense.ai/how-to/document_search/async_processing/)
* [API Reference - Document Search](https://ragbits.deepsense.ai/api_reference/document_search/)
