Metadata-Version: 2.4
Name: llama-index-readers-markitdown
Version: 0.2.1
Summary: llama-index readers MarkItDown integration
Author-email: Clelia Astra Bertelli <astraberte9@gmail.com>
Maintainer: AstraBert
License-Expression: MIT
License-File: LICENSE
Requires-Python: <4.0,>=3.10
Requires-Dist: llama-index-core<0.15,>=0.13.0
Requires-Dist: markitdown[all]==0.1.0
Description-Content-Type: text/markdown

# LlamaIndex MarkItDown Reader Integration

[MarkItDown](https://github.com/microsoft/markitdown) is a powerful tool that converts various file formats to Markdown.

`llama-index-readers-markitdown` is an integration that uses MarkItDown to extract text from various file formats, supporting:

- .txt files and text-based files without extension
- .csv, .xml and .json files
- HTML files (.html)
- Presentations (.pptx)
- Word documents (.docx)
- PDF documents (.pdf)
- ZIP files (.zip)

You can install it via:

```bash
pip install llama-index-readers-markitdown
```

And you can use it in your scripts as follows:

```python
from llama_index.readers.markitdown import MarkItDownReader

reader = MarkItDownReader()
documents = reader.load_data("presentation.pptx")
```
