Metadata-Version: 2.4
Name: opendataloader-pdf-mcp
Version: 0.2.0
Summary: MCP server for OpenDataLoader PDF — convert PDFs to Markdown, JSON, HTML, and more
License-Expression: Apache-2.0
Keywords: ai-agents,llm,mcp,model-context-protocol,pdf,scientific-papers
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Text Processing :: Markup
Requires-Python: >=3.10
Requires-Dist: mcp[cli]<2,>=1.27.0
Requires-Dist: opendataloader-pdf
Description-Content-Type: text/markdown

# OpenDataLoader PDF MCP Server

MCP (Model Context Protocol) server for [OpenDataLoader PDF](https://github.com/opendataloader-project/opendataloader-pdf).

Enables AI agents to convert PDFs to Markdown, JSON, HTML, and more via MCP.

## Prerequisites

- Java 11+
- Python 3.10+

## Installation

```bash
pip install opendataloader-pdf-mcp
```

## Usage

### Claude Desktop

Add to your Claude Desktop config (`claude_desktop_config.json`):

```json
{
  "mcpServers": {
    "opendataloader-pdf": {
      "command": "uvx",
      "args": ["opendataloader-pdf-mcp"]
    }
  }
}
```

### Claude Code

```bash
claude mcp add opendataloader-pdf -- uvx opendataloader-pdf-mcp
```

### OpenAI Codex

```bash
codex --mcp-config mcp.json
```

`mcp.json`:

```json
{
  "mcpServers": {
    "opendataloader-pdf": {
      "command": "uvx",
      "args": ["opendataloader-pdf-mcp"]
    }
  }
}
```

### Cursor

Add to `.cursor/mcp.json` in your project:

```json
{
  "mcpServers": {
    "opendataloader-pdf": {
      "command": "uvx",
      "args": ["opendataloader-pdf-mcp"]
    }
  }
}
```

### Windsurf

Add to `~/.codeium/windsurf/mcp_config.json`:

```json
{
  "mcpServers": {
    "opendataloader-pdf": {
      "command": "uvx",
      "args": ["opendataloader-pdf-mcp"]
    }
  }
}
```

### Other MCP Clients

Any MCP-compatible client can use this server. The command is:

```bash
uvx opendataloader-pdf-mcp
```

## Tools

### convert_pdf

Convert a PDF file to the specified format.

**Parameters:**

- `input_path` (required): Path to the input PDF file
- `format`: Output format — `json`, `text`, `html`, `markdown` (default), `markdown-with-html`, `markdown-with-images`
- `pages`: Pages to extract (e.g., `"1,3,5-7"`)
- `password`: Password for encrypted PDFs
- All other [OpenDataLoader PDF options](https://opendataloader.org/docs/options) are supported

## License

Apache-2.0
