Metadata-Version: 2.4
Name: llama-index-readers-remote
Version: 0.4.1
Summary: llama-index readers remote integration
Author-email: Your Name <you@example.com>
Maintainer: thejessezhang
License-Expression: MIT
License-File: LICENSE
Keywords: gutenberg,hosted,url
Requires-Python: <4.0,>=3.9
Requires-Dist: llama-index-core<0.15,>=0.13.0
Requires-Dist: llama-index-readers-youtube-transcript<0.5,>=0.4.0
Description-Content-Type: text/markdown

# Remote Page/File Loader

```bash
pip install llama-index-readers-remote
```

This loader makes it easy to extract the text from any remote page or file using just its url. If there's a file at the url, this loader will download it temporarily and parse it using `SimpleDirectoryReader`. It is an all-in-one tool for (almost) any url.

As a result, any page or type of file is supported. For instance, if a `.txt` url such as a [Project Gutenberg book](https://www.gutenberg.org/cache/epub/69994/pg69994.txt) is passed in, the text will be parsed as is. On the other hand, if a hosted .mp3 url is passed in, it will be downloaded and parsed using `AudioTranscriber`.

## Usage

To use this loader, you need to pass in a `Path` to a local file. Optionally, you may specify a `file_extractor` for the `SimpleDirectoryReader` to use, other than the default one.

```python
from llama_index.readers.remote import RemoteReader

loader = RemoteReader()
documents = loader.load_data(
    url="https://en.wikipedia.org/wiki/File:Example.jpg"
)
```

This loader is designed to be used as a way to load data into [LlamaIndex](https://github.com/run-llama/llama_index/).
