Metadata-Version: 2.4
Name: mcp-webextrator
Version: 2026.7.1.0
Summary: MCP Server for WebExtrator Web Render & Extract via AceDataCloud API
Project-URL: Homepage, https://github.com/AceDataCloud/WebExtraterMCP
Project-URL: Documentation, https://platform.acedata.cloud/documents/webextrator
Project-URL: Repository, https://github.com/AceDataCloud/WebExtraterMCP
Project-URL: Issues, https://github.com/AceDataCloud/WebExtraterMCP/issues
Author-email: AceDataCloud <support@acedata.cloud>
Maintainer-email: AceDataCloud <support@acedata.cloud>
License: MIT
License-File: LICENSE
Keywords: acedata,api,mcp,model-context-protocol,scraping,web-extraction,web-rendering,webextrator
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Internet :: WWW/HTTP
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Requires-Python: >=3.10
Requires-Dist: httpx>=0.27.0
Requires-Dist: loguru>=0.7.0
Requires-Dist: mcp>=1.2.0
Requires-Dist: pydantic>=2.0.0
Requires-Dist: python-dotenv>=1.0.0
Provides-Extra: all
Requires-Dist: build>=1.2.0; extra == 'all'
Requires-Dist: mypy>=1.10.0; extra == 'all'
Requires-Dist: pre-commit>=3.7.0; extra == 'all'
Requires-Dist: pytest-asyncio>=0.23.0; extra == 'all'
Requires-Dist: pytest-cov>=5.0.0; extra == 'all'
Requires-Dist: pytest>=8.0.0; extra == 'all'
Requires-Dist: respx>=0.21.0; extra == 'all'
Requires-Dist: ruff>=0.4.0; extra == 'all'
Requires-Dist: twine>=5.0.0; extra == 'all'
Provides-Extra: dev
Requires-Dist: mypy>=1.10.0; extra == 'dev'
Requires-Dist: pre-commit>=3.7.0; extra == 'dev'
Requires-Dist: ruff>=0.4.0; extra == 'dev'
Provides-Extra: release
Requires-Dist: build>=1.2.0; extra == 'release'
Requires-Dist: twine>=5.0.0; extra == 'release'
Provides-Extra: test
Requires-Dist: pytest-asyncio>=0.23.0; extra == 'test'
Requires-Dist: pytest-cov>=5.0.0; extra == 'test'
Requires-Dist: pytest>=8.0.0; extra == 'test'
Requires-Dist: respx>=0.21.0; extra == 'test'
Description-Content-Type: text/markdown

# MCP WebExtrator Server

A Model Context Protocol (MCP) server for web rendering and structured content extraction
via the AceDataCloud WebExtrator platform.

## Features

- **Structured extraction**: Pull structured data out of any URL via WebExtrator
- **Web rendering**: Render dynamic JavaScript pages and capture the rendered output
- **Asynchronous tasks**: Submit extract / render jobs and poll for results
- **Batch task lookup**: Query multiple task results in one call

## Installation

```bash
pip install mcp-webextrator
```

## Configuration

Set your AceDataCloud API token:

```bash
export ACEDATACLOUD_API_TOKEN=your_token_here
```

Get your token from [https://platform.acedata.cloud](https://platform.acedata.cloud).

## Usage

### stdio mode (default)

```bash
mcp-webextrator
```

### HTTP mode

```bash
mcp-webextrator --transport http --port 8000
```

## Available Tools

| Tool | Description |
|------|-------------|
| `webextrator_extract` | Extract structured content from a URL |
| `webextrator_render` | Render a dynamic web page and return the rendered output |
| `webextrator_get_task` | Get the status / result of an extract or render task |
| `webextrator_get_tasks_batch` | Batch-fetch the status / result of multiple tasks |
| `webextrator_get_usage_guide` | Get the API usage guide |

## License

MIT — see [LICENSE](LICENSE).
