Metadata-Version: 2.4
Name: docx-mcp-server
Version: 0.2.1
Summary: MCP server for reading and editing Word (.docx) documents with track changes, comments, footnotes, tables, images, sections, and structural validation
Project-URL: Homepage, https://github.com/SecurityRonin/docx-mcp
Project-URL: Repository, https://github.com/SecurityRonin/docx-mcp
Project-URL: Issues, https://github.com/SecurityRonin/docx-mcp/issues
Author: SecurityRonin
License-Expression: MIT
License-File: LICENSE
Keywords: comments,docx,footnotes,mcp,ooxml,track-changes,word
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Text Processing :: Markup :: XML
Requires-Python: >=3.10
Requires-Dist: lxml>=4.9.0
Requires-Dist: mcp>=1.0.0
Description-Content-Type: text/markdown

# docx-mcp

[![PyPI](https://img.shields.io/pypi/v/docx-mcp-server?color=blue)](https://pypi.org/project/docx-mcp-server/)
[![Python](https://img.shields.io/pypi/pyversions/docx-mcp-server)](https://pypi.org/project/docx-mcp-server/)
[![License: MIT](https://img.shields.io/badge/License-MIT-green.svg)](https://opensource.org/licenses/MIT)
[![CI](https://github.com/SecurityRonin/docx-mcp/actions/workflows/ci.yml/badge.svg)](https://github.com/SecurityRonin/docx-mcp/actions/workflows/ci.yml)
[![Coverage](https://img.shields.io/badge/coverage-100%25-brightgreen)](https://github.com/SecurityRonin/docx-mcp)
[![Sponsor](https://img.shields.io/github/sponsors/h4x0r?logo=githubsponsors&label=Sponsor&color=%23ea4aaa)](https://github.com/sponsors/h4x0r)

<a href="https://glama.ai/mcp/servers/SecurityRonin/docx-mcp">
  <img width="380" height="200" src="https://glama.ai/mcp/servers/SecurityRonin/docx-mcp/badges/card.svg" alt="docx-mcp MCP server" />
</a>

MCP server for reading and editing Word (.docx) documents with track changes, comments, footnotes, tables, images, sections, and structural validation.

The only cross-platform MCP server that combines **track changes**, **comments**, **footnotes**, **tables**, **formatting**, **images**, **sections**, **cross-references**, **document merge**, and **protection** in a single package — with OOXML-level structural validation that no other server offers.

## Features

| Capability | Description |
|---|---|
| **Track changes** | Insert/delete text with proper `w:ins`/`w:del` markup — shows as revisions in Word |
| **Accept/reject changes** | Accept or reject tracked changes (all or by author) |
| **Character formatting** | Bold, italic, underline, color — with tracked-change markup |
| **Comments** | Add comments, reply to threads, read existing comments |
| **Footnotes & endnotes** | Add, list, and validate cross-references for both |
| **Tables** | Create tables, modify cells, add/delete rows — all with tracked changes |
| **Lists** | Apply bullet or numbered list formatting to paragraphs |
| **Images** | List embedded images, insert new images with dimensions |
| **Headers/footers** | Read and edit header/footer content with tracked changes |
| **Styles & properties** | Read styles, get/set document properties (title, creator, etc.) |
| **Sections & page breaks** | Insert page/section breaks, set page size/orientation/margins |
| **Cross-references** | Add internal hyperlinks between paragraphs with bookmarks |
| **Document merge** | Merge content from another DOCX with automatic paraId remapping |
| **Document protection** | Set tracked-changes/read-only/comments protection with SHA-512 passwords |
| **Structural audit** | Validate footnotes, endnotes, paraIds, headings, bookmarks, tables, images, protection |
| **Watermark removal** | Detect and remove VML watermarks (e.g., DRAFT) from headers |

## Installation

```bash
# Register MCP server with Claude Code
claude mcp add docx-mcp -- uvx docx-mcp-server

# Install the companion skill (teaches Claude when and how to use the tools)
uvx --from docx-mcp-server docx-mcp install-skill
```

The skill ships inside the package. After upgrading, run `uvx --from docx-mcp-server docx-mcp update-skill` to refresh it.

<details>
<summary>Other installation methods</summary>

```bash
# With pip
pip install docx-mcp-server
docx-mcp install-skill

# With uvx (standalone, no pip install)
uvx --from docx-mcp-server docx-mcp install-skill
```

</details>

### Upgrading

```bash
# uvx users — uvx always fetches the latest, just refresh the skill
uvx --from docx-mcp-server docx-mcp update-skill

# pip users
pip install --upgrade docx-mcp-server
docx-mcp update-skill
```

## Configuration

### Claude Desktop / Claude Code

Add to your MCP settings:

```json
{
  "mcpServers": {
    "docx-mcp": {
      "command": "uvx",
      "args": ["docx-mcp-server"]
    }
  }
}
```

### Cursor / Windsurf / VS Code

Add to your MCP configuration file:

```json
{
  "mcpServers": {
    "docx-mcp": {
      "command": "uvx",
      "args": ["docx-mcp-server"]
    }
  }
}
```

### OpenClaw

Add to your `openclaw.yaml`:

```yaml
mcpServers:
  docx-mcp:
    command: uvx
    args:
      - docx-mcp-server
```

Or via the CLI:

```bash
openclaw config set mcpServers.docx-mcp.command "uvx"
openclaw config set mcpServers.docx-mcp.args '["docx-mcp-server"]'
```

### With pip install

```json
{
  "mcpServers": {
    "docx-mcp": {
      "command": "docx-mcp"
    }
  }
}
```

## Available Tools (43)

### Document Lifecycle

| Tool | Description |
|---|---|
| `open_document` | Open a .docx file for reading and editing |
| `close_document` | Close the current document and clean up |
| `get_document_info` | Get overview stats (paragraphs, headings, footnotes, comments) |
| `save_document` | Save changes back to .docx (can overwrite or save to new path) |

### Reading

| Tool | Description |
|---|---|
| `get_headings` | Get heading structure with levels, text, styles, and paraIds |
| `search_text` | Search across body, footnotes, and comments (text or regex) |
| `get_paragraph` | Get full text and style of a paragraph by paraId |

### Track Changes

| Tool | Description |
|---|---|
| `insert_text` | Insert text with tracked-change markup (`w:ins`) |
| `delete_text` | Mark text as deleted with tracked-change markup (`w:del`) |
| `accept_changes` | Accept tracked changes (all or by author) |
| `reject_changes` | Reject tracked changes (all or by author) |
| `set_formatting` | Apply bold/italic/underline/color with tracked-change markup |

### Tables

| Tool | Description |
|---|---|
| `get_tables` | Get all tables with row/column counts and cell content |
| `add_table` | Insert a new table after a paragraph with tracked insertion |
| `modify_cell` | Modify a table cell with tracked changes |
| `add_table_row` | Add a row to a table with tracked insertion |
| `delete_table_row` | Delete a table row with tracked changes |

### Lists

| Tool | Description |
|---|---|
| `add_list` | Apply bullet or numbered list formatting to paragraphs |

### Comments

| Tool | Description |
|---|---|
| `get_comments` | List all comments with ID, author, date, and text |
| `add_comment` | Add a comment anchored to a paragraph |
| `reply_to_comment` | Reply to an existing comment (threaded) |

### Footnotes & Endnotes

| Tool | Description |
|---|---|
| `get_footnotes` | List all footnotes with ID and text |
| `add_footnote` | Add a footnote with superscript reference |
| `validate_footnotes` | Cross-reference footnote IDs between body and footnotes.xml |
| `get_endnotes` | List all endnotes with ID and text |
| `add_endnote` | Add an endnote with superscript reference |
| `validate_endnotes` | Cross-reference endnote IDs between body and endnotes.xml |

### Headers, Footers & Styles

| Tool | Description |
|---|---|
| `get_headers_footers` | Get all headers and footers with text content |
| `edit_header_footer` | Edit header/footer text with tracked changes |
| `get_styles` | Get all defined styles with ID, name, type, and base style |

### Properties & Images

| Tool | Description |
|---|---|
| `get_properties` | Get core document properties (title, creator, dates, revision) |
| `set_properties` | Set core document properties (title, creator, subject, description) |
| `get_images` | Get all embedded images with rId, filename, content type, dimensions |
| `insert_image` | Insert an image after a paragraph with specified dimensions |

### Sections & Cross-References

| Tool | Description |
|---|---|
| `add_page_break` | Insert a page break after a paragraph |
| `add_section_break` | Add a section break (nextPage, continuous, evenPage, oddPage) |
| `set_section_properties` | Set page size, orientation, and margins for a section |
| `add_cross_reference` | Add a cross-reference link between paragraphs with bookmarks |

### Protection & Merge

| Tool | Description |
|---|---|
| `set_document_protection` | Set document protection (trackedChanges, readOnly, comments, forms) |
| `merge_documents` | Merge content from another DOCX with paraId remapping |

### Validation & Audit

| Tool | Description |
|---|---|
| `validate_paraids` | Check paraId uniqueness and range validity across all parts |
| `remove_watermark` | Remove VML watermarks from document headers |
| `audit_document` | Comprehensive structural audit (footnotes, endnotes, paraIds, headings, bookmarks, tables, relationships, images, protection, artifacts) |

## Example Workflow

```
1. open_document("/path/to/contract.docx")
2. get_headings()                          → see document structure
3. search_text("30 days")                  → find the clause
4. delete_text(para_id, "30 days")         → tracked deletion
5. insert_text(para_id, "60 days")         → tracked insertion
6. add_comment(para_id, "Extended per client request")
7. audit_document()                        → verify structural integrity
8. save_document("/path/to/contract_revised.docx")
```

The resulting document opens in Microsoft Word with proper revision marks — deletions shown as red strikethrough, insertions as green underline, comments in the sidebar.

## How It Works

A .docx file is a ZIP archive of XML files. This server:

1. **Unpacks** the archive to a temporary directory
2. **Parses** all XML parts with lxml and caches them in memory
3. **Edits** the cached DOM trees directly (no intermediate abstraction layer)
4. **Repacks** modified XML back into a valid .docx archive

This approach gives full control over OOXML markup — essential for track changes (`w:ins`/`w:del`), comments (`w:comment` + range markers), and structural validation that higher-level libraries like python-docx don't expose.

## Requirements

- Python 3.10+
- lxml

## License

MIT

<img referrerpolicy="no-referrer-when-downgrade" src="https://static.scarf.sh/a.png?x-pxid=95beebbb-0f2e-46cc-9a68-a8e66f613180" />
