Metadata-Version: 2.4
Name: paddleocr-cli
Version: 0.2.1
Summary: A command-line tool for OCR using PaddleOCR AI Studio API
License: MIT
License-File: LICENSE
Keywords: ocr,paddleocr,cli,document,pdf
Author: PaddleOCR
Requires-Python: >=3.10,<4.0
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Programming Language :: Python :: 3.14
Classifier: Topic :: Text Processing
Classifier: Topic :: Utilities
Provides-Extra: yaml
Requires-Dist: pyyaml (>=6.0,<7.0) ; extra == "yaml"
Description-Content-Type: text/markdown

# PaddleOCR CLI

A command-line tool for OCR using PaddleOCR AI Studio API.

Available in both **Go** (native binary) and **Python** versions.

## Installation

### Go Version (Recommended)

#### Download Binary

Download the latest release from [GitHub Releases](https://github.com/Explorer1092/paddleocr_cli/releases).

#### Using Homebrew (macOS/Linux)

```bash
brew install Explorer1092/tap/paddleocr-cli
```

#### Build from Source

```bash
git clone https://github.com/Explorer1092/paddleocr_cli.git
cd paddleocr_cli
go build -o paddleocr-cli ./cmd/paddleocr-cli
```

### Python Version

#### Using pip

```bash
pip install paddleocr-cli
```

#### Using uv

```bash
uv pip install paddleocr-cli
```

#### Using Poetry (Development)

```bash
git clone https://github.com/Explorer1092/paddleocr_cli.git
cd paddleocr_cli
poetry install
```

## Quick Start

### 1. Configure credentials

```bash
# Configure (default saves to user directory)
paddleocr-cli configure --server-url https://your-server.com --token YOUR_TOKEN

# Configure with scope
paddleocr-cli configure --server-url URL --token TOKEN -s project  # project root
paddleocr-cli configure --server-url URL --token TOKEN -s local    # current directory
paddleocr-cli configure --server-url URL --token TOKEN -s user     # ~/.config/ (default)

# Verify configuration
paddleocr-cli configure --show

# Test connection
paddleocr-cli configure --test
```

### 2. Run OCR

```bash
# Output to stdout
paddleocr-cli resume.pdf

# Output to file
paddleocr-cli resume.pdf -o output.md

# Output as JSON
paddleocr-cli resume.pdf --json
```

## Usage

```
paddleocr-cli [OPTIONS] FILE

positional arguments:
  FILE                  Path to PDF or image file

options:
  -o, --output FILE     Output file path (default: stdout)
  --json                Output as JSON instead of markdown
  --page N              Extract only page N (0-indexed)
  --no-separator        Don't add page separators in markdown output
  --timeout SECONDS     Request timeout in seconds (default: 120)
  --orientation         Enable document orientation classification
  --unwarp              Enable document unwarping
  --chart               Enable chart recognition
  -q, --quiet           Suppress progress messages
  --config FILE         Path to config file
```

### Configure subcommand

```
paddleocr-cli configure [OPTIONS]

options:
  --server-url URL      Set the server URL (required)
  --token TOKEN         Set the access token (required)
  -s, --scope SCOPE     Installation scope (default: user)
                        user    - ~/.config/paddleocr_cli/
                        project - project root (alongside .claude/)
                        local   - current directory
  --show                Show current configuration
  --test                Test connection to the server
  --locations           Show config file search locations
```

## Configuration

Configuration is stored in YAML format (`.paddleocr_cli.yaml`). The tool searches for config files in this order:

1. **Current directory**: `./.paddleocr_cli.yaml`
2. **Project root**: `<project>/.paddleocr_cli.yaml` (alongside `.claude/` directory)
3. **User directory**: `~/.config/paddleocr_cli/config.yaml`

### Config file format

```yaml
paddleocr:
  server_url: "https://your-server.aistudio-app.com"
  access_token: "your_token_here"
```

## Python API

You can also use the Python library programmatically:

```python
from paddleocr_cli import PaddleOCRClient, Config, PaddleOCRConfig

# Use default config (from config files)
client = PaddleOCRClient()

# Or provide custom config
config = Config(
    paddleocr=PaddleOCRConfig(
        server_url="https://your-server.com",
        access_token="your_token",
    )
)
client = PaddleOCRClient(config)

# Perform OCR
result = client.ocr_file("document.pdf")

if result.success:
    print(result.full_markdown)
    for page in result.pages:
        print(f"Page {page.page_index}: {len(page.markdown)} chars")
else:
    print(f"Error: {result.error_message}")
```

## Running Python version as a module

```bash
# Using python -m
python -m paddleocr_cli resume.pdf

# Using poetry
poetry run paddleocr-cli resume.pdf

# Using uv
uv run paddleocr-cli resume.pdf
```

## License

MIT License

