Metadata-Version: 2.4
Name: docu-fluent
Version: 0.0.1
Summary: A powerful Word document translation SDK with multi-model workflow.
Author-email: chenhao <twch97780@gmail.com>
License: CC BY-NC-SA 4.0
License-File: LICENSE
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Requires-Python: >=3.11
Requires-Dist: gradio>=4.26.0
Requires-Dist: loguru>=0.7.3
Requires-Dist: mammoth>=1.6.0
Requires-Dist: openai>=2.8.1
Requires-Dist: openpyxl>=3.1.5
Requires-Dist: pandas>=2.3.3
Requires-Dist: pydantic>=2.12.5
Requires-Dist: python-docx>=1.2.0
Requires-Dist: reportlab>=4.4.5
Requires-Dist: requests>=2.32.3
Requires-Dist: tenacity>=9.1.2
Description-Content-Type: text/markdown

# Word Document Translation SDK

[English](README.md) | [中文](README_zh.md)

A Python SDK for translating Word documents (`.docx`) using a multi-model workflow (Translate -> Evaluate -> Optimize -> Select).

## Features

- **Multi-Model Workflow**: 
    - **Initial Translation Model**: Initial Translation
    - **Evaluation Model**: Evaluation (Accuracy, Fluency, Consistency, Terminology, Completeness)
    - **Optimization Model**: Optimization based on evaluation
    - **Selection**: Automatically selects the best translation based on scores.
- **Format Preservation**: Preserves paragraph styles, tables, and formulas.
- **Strict Language Enforcement**: Prevents accidental translation to English and ensures strict adherence to the target language.
- **Model Number Preservation**: Automatically detects and preserves alphanumeric codes and model numbers (e.g., "STR-1650").
- **Comprehensive Reporting**: Generates Excel and PDF reports with detailed evaluation metrics.
- **Bilingual Output**: Generates a bilingual document (Original + Translation) with preserved indentation and formatting.

## Workflow Overview

```mermaid
graph TD
    Start["Start: Input Document"] --> Extract["Extract Segments"]
    Extract --> Stage1["Stage 1: Initial Translation (Model A)"]
    Stage1 --> CheckSimple{"Is Simple Segment?"}
    CheckSimple -- Yes --> Skip["Skip Translation"]
    CheckSimple -- No --> CheckCache{"In Cache?"}
    CheckCache -- Yes --> UseCache["Use Cached Translation"]
    CheckCache -- No --> Translate["Call Model A"]
    Translate --> Stage1_5["Stage 1.5: Repair Checks"]
    UseCache --> Stage1_5
    
    Stage1_5 --> CheckFail{"Translation == Original?"}
    CheckFail -- Yes --> Repair["Retry Translation"]
    CheckFail -- No --> Stage2
    Repair --> Stage2["Stage 2: Evaluation 1 (Model B)"]
    Skip --> Stage2
    
    Stage2 --> EvalA["Evaluate Model A"]
    EvalA --> Stage3["Stage 3: Optimization (Model C)"]
    
    Stage3 --> CheckScore{"Score >= 9.5?"}
    CheckScore -- Yes --> SkipOpt["Skip Optimization"]
    CheckScore -- No --> Optimize["Call Model C with Suggestions"]
    
    SkipOpt --> Stage4
    Optimize --> Stage4["Stage 4: Comparative Evaluation (Model B)"]
    
    Stage4 --> EvalComp["Evaluate A vs C"]
    EvalComp --> Stage5["Stage 5: Selection"]
    
    Stage5 --> Select{"Score C > Score A?"}
    Select -- Yes --> FinalC["Select Model C"]
    Select -- No --> FinalA["Select Model A"]
    
    FinalC --> Output["Generate Documents & Reports"]
    FinalA --> Output
    
    style Start fill:#f9f,stroke:#333,stroke-width:2px
    style Output fill:#f9f,stroke:#333,stroke-width:2px
```

## Installation
 
You can install the SDK directly using `pip` or `uv`:
 
```bash
# Using pip
pip install docu-fluent
 
# Using uv
uv pip install docu-fluent
```
 
Alternatively, for development:
 
```bash
# Clone the repository
git clone <repository-url>
cd translation-sdk
 
# Install dependencies
uv sync
```

## Usage

### Command Line Interface (CLI)

You can use the SDK directly from the command line.

```bash
# Set up your API key (if using OpenAI)
export OPENAI_API_KEY="your-api-key"

# Run translation with Azure OpenAI
uv run python -m docu_fluent input.docx \
    --output-dir output \
    --provider azure \
    --base-url https://your-resource.openai.azure.com/ \
    --api-key your-azure-key \
    --api-version 2023-05-15 \
    --model-translation gpt-35-turbo-deployment \
    --model-evaluation gpt-4-deployment \
    --model-optimization gpt-4-deployment \
    --source-lang auto \
    --target-lang "French"
```

**Arguments:**

- `input_file`: Path to the `.docx` file to translate.
- `--output-dir`: Directory to save the output files (default: `output`).
- `--provider`: LLM provider to use (`openai`, `azure`, or `mock`). Default is `mock`.
- `--api-key`: API key for the provider.
- `--base-url`: Base URL (OpenAI) or Azure Endpoint (Azure).
- `--api-version`: API Version (Azure only, e.g., `2023-05-15`).
- `--model-translation`: Model/Deployment for translation.
- `--model-evaluation`: Model/Deployment for evaluation.
- `--model-optimization`: Model/Deployment for optimization.
- `--source-lang`: Source language (default: `auto`).
- `--target-lang`: Target language (default: `Chinese`).
- `--config`: Path to a JSON configuration file (e.g., `model_config.json`). If provided, model arguments are ignored.

### Quick Start with Configuration File

For easier usage, you can configure your models in `model_config.json` and use the `translate.py` script.

1.  **Configure Models**: Create or edit `model_config.json`:
    ```json
    {
        "translation_config": {
            "provider": "openai",
            "api_key": "sk-...",
            "base_url": "https://api.openai.com/v1",
            "model": "gpt-3.5-turbo"
        },
        "evaluation_config": {
            "provider": "openai",
            "api_key": "sk-...",
            "base_url": "https://api.openai.com/v1",
            "model": "gpt-4"
        },
        "optimization_config": {
            "provider": "openai",
            "api_key": "sk-...",
            "base_url": "https://api.openai.com/v1",
            "model": "gpt-4"
        },
        "concurrency_config": {
            "translation": 32,
            "evaluation_1": 32,
            "optimization": 32,
            "evaluation_2": 32
        }
    }
    ```

2.  **Run Translation**:
    ```bash
    uv run python -m docu_fluent input.docx --config model_config.json --target-lang "Chinese"
    ```

### Web Interface (GUI)

You can launch a user-friendly web interface to run translations without using the command line arguments.

**Launch Command:**
```bash
# Using the test script
uv run python translate.py --gui

# OR using the main SDK module (if installed)
uv run python -m docu_fluent --gui
```

**Features:**
-   **File Upload**: Drag and drop `.docx` files.
-   **Language Selection**: Select source (auto-detect supported) and target languages from a dropdown.
-   **Settings (JSON)**: Directly edit the `model_config.json` content in the "Settings" tab to configure:
    -   **Models**: Provider, API Key, Base URL, and Model Name for each stage (Translation, Evaluation, Optimization).
    -   **Concurrency**: Number of parallel workers for each stage.
-   **Progress Tracking**: Real-time progress bars for translation, evaluation, and optimization stages.
-   **Downloads**: Download all generated files (Translated Doc, Bilingual Doc, Reports, JSON Logs) directly from the browser.

### Python SDK

You can also use the SDK in your Python code.

```python
from docu_fluent.sdk import TranslationSDK

# Initialize SDK with specific configurations for each model
# This allows using different providers/models for each step
# Initialize SDK with specific configurations for each model
# This allows using different providers/models for each step
sdk = TranslationSDK(
    translation_config={
        "provider": "openai",
        "api_key": "key-for-provider-1",
        "base_url": "https://api.provider1.com/v1",
        "model": "model-name-1"
    },
    evaluation_config={
        "provider": "openai",
        "api_key": "key-for-provider-2",
        "base_url": "https://api.provider2.com/v1",
        "model": "model-name-2"
    },
    optimization_config={
        "provider": "azure",
        "api_key": "azure-key",
        "base_url": "https://your-resource.openai.azure.com/",
        "api_version": "2023-05-15",
        "model": "gpt-4-deployment"
    }
)

# Translate a document
sdk.translate_document(
    "path/to/document.docx", 
    output_dir="output",
    source_lang="English",
    target_lang="Spanish"
)
```

### Regenerating Documents

If you need to make manual corrections, you can edit the generated Excel report (`{filename}_report.xlsx`) and then regenerate the documents.

1.  Open the Excel report.
2.  (Optional) Add a column named `final_translation` with your corrected text.
3.  Run the regeneration script:

```bash
uv run python regenerate_docs.py \
    --input-docx "path/to/original.docx" \
    --input-excel "path/to/edited_report.xlsx" \
    --output-dir "output_folder"
```

This will generate:
- `{filename}_regenerated_translated.docx`
- `{filename}_regenerated_bilingual.docx`


## Output Files

The SDK generates the following files in the output directory:

1.  `{filename}_translated.docx`: The fully translated document.
2.  `{filename}_bilingual.docx`: A document with both original and translated text.
3.  `{filename}_report.xlsx`: An Excel file containing detailed scores for each segment across 5 dimensions.
4.  `{filename}_report.pdf`: A PDF summary of the translation quality.
5.  `{filename}_usage.json`: Token usage statistics for the translation task.
6.  `{filename}_model_mapping.json`: Mapping of model aliases (A, B, C) to actual model names used.
7.  `{filename}_results.json`: Full detailed results including all intermediate steps and raw LLM responses.

## Evaluation Dimensions

The translation is evaluated on 5 dimensions (0-10 score):
1.  **Accuracy**: How accurately the meaning is conveyed.
2.  **Fluency**: How natural the translation sounds.
3.  **Consistency**: Consistency of terminology and style.
4.  **Terminology**: Accuracy of specific domain terms.
5.  **Completeness**: Whether all content is translated.

## License

This project is licensed under the **GNU Affero General Public License v3.0 (AGPLv3)**.

See the [LICENSE](LICENSE) file for details.
