|
8 | 8 |
|
9 | 9 | > **Document Operations MCP Server** - A universal MCP server for document processing, conversion, and automation. Handle PDF, DOCX, HTML, Markdown, and more through a unified API and toolset. |
10 | 10 |
|
| 11 | +## Demo |
| 12 | + |
| 13 | +### Video |
| 14 | + |
| 15 | +<https://github-production-user-asset-6210df.s3.amazonaws.com/83206877/478999982-463aba05-bf7e-4d18-a828-f9898c743787.mp4?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAVCODYLSA53PQK4ZA%2F20250818%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20250818T104249Z&X-Amz-Expires=300&X-Amz-Signature=cea37a894e1442764e7aa165c7c40e080e4935435c159327af7f882eadbad128&X-Amz-SignedHeaders=host> |
| 16 | + |
| 17 | +In this demo, we showcase how to: |
| 18 | + |
| 19 | +- Configure doc-ops-mcp in MCP clients |
| 20 | +- Convert Markdown documents to HTML format |
| 21 | +- Convert the resulting HTML to PDF documents |
| 22 | + |
11 | 23 | ## Table of Contents |
12 | 24 |
|
13 | 25 | 1. [Quick Start](#1-quick-start) |
|
22 | 34 | 10. [Troubleshooting](#10-troubleshooting) |
23 | 35 | 11. [Contributing](#11-contributing) |
24 | 36 |
|
25 | | -## 1. Getting Started |
| 37 | +## 1. Quick Start |
26 | 38 |
|
27 | 39 | First, add the Document Operations MCP server to your MCP client. |
28 | 40 |
|
@@ -185,13 +197,17 @@ Document Operations MCP Server adopts a pure JavaScript architecture design, pro |
185 | 197 | ┌───────────────────────────┴─────────────────────────────────┐ |
186 | 198 | │ Core Dependencies Layer │ |
187 | 199 | │ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ │ |
188 | | -│ │ pdf-lib │ │ mammoth │ │ marked │ │ |
189 | | -│ │ (PDF Tools) │ │(DOCX Tools) │ │ (Markdown) │ │ |
| 200 | +│ │ pdf-lib │ │word-extractor│ │ marked │ │ |
| 201 | +│ │ (PDF Tools) │ │(DOCX Reader)│ │ (Markdown) │ │ |
190 | 202 | │ └─────────────┘ └─────────────┘ └─────────────┘ │ |
191 | 203 | │ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ │ |
192 | | -│ │ cheerio │ │ turndown │ │ docx │ │ |
193 | | -│ │(HTML Parser)│ │(HTML to MD) │ │(DOCX Gen.) │ │ |
| 204 | +│ │ cheerio │ │ jszip │ │ docx │ │ |
| 205 | +│ │(HTML Parser)│ │(ZIP Handler)│ │(DOCX Gen.) │ │ |
194 | 206 | │ └─────────────┘ └─────────────┘ └─────────────┘ │ |
| 207 | +│ ┌─────────────┐ ┌─────────────┐ │ |
| 208 | +│ │ xml2js │ │Custom OOXML │ │ |
| 209 | +│ │(XML Parser) │ │ Parser │ │ |
| 210 | +│ └─────────────┘ └─────────────┘ │ |
195 | 211 | └─────────────────────────────────────────────────────────────┘ |
196 | 212 | ``` |
197 | 213 |
|
@@ -390,11 +406,13 @@ Convert HTML to Markdown. |
390 | 406 |
|
391 | 407 | ### Core Technology Stack |
392 | 408 | - **pdf-lib** - PDF operations and enhancement |
393 | | -- **mammoth** - DOCX document processing |
| 409 | +- **word-extractor** - DOCX document text extraction |
394 | 410 | - **marked** - Markdown parsing and rendering |
395 | 411 | - **cheerio** - HTML parsing and manipulation |
396 | | -- **turndown** - HTML to Markdown conversion |
397 | 412 | - **docx** - DOCX document generation |
| 413 | +- **jszip** - ZIP file processing |
| 414 | +- **xml2js** - XML parsing and conversion |
| 415 | +- **Custom OOXML Parser** - Advanced DOCX style preservation |
398 | 416 |
|
399 | 417 | ### Installation |
400 | 418 | ```bash |
@@ -427,11 +445,12 @@ bun add -g doc-ops-mcp |
427 | 445 | | Library | Version | License | Purpose | |
428 | 446 | |---------|---------|---------|----------| |
429 | 447 | | **pdf-lib** | ^1.17.1 | MIT | PDF document manipulation | |
430 | | -| **mammoth** | ^1.6.0 | BSD-2-Clause | DOCX parsing and conversion | |
431 | | -| **marked** | ^9.1.6 | MIT | Markdown parsing and rendering | |
432 | | -| **exceljs** | ^4.4.0 | MIT | Excel file processing | |
433 | | -| **jsdom** | ^23.0.1 | MIT | HTML DOM manipulation | |
434 | | -| **turndown** | ^7.1.2 | MIT | HTML to Markdown conversion | |
| 448 | +| **word-extractor** | ^1.0.4 | MIT | DOCX document text extraction | |
| 449 | +| **marked** | ^15.0.12 | MIT | Markdown parsing and rendering | |
| 450 | +| **cheerio** | ^1.0.0-rc.12 | MIT | HTML parsing and manipulation | |
| 451 | +| **docx** | ^9.5.1 | Apache-2.0 | DOCX document generation | |
| 452 | +| **jszip** | ^3.10.1 | MIT | ZIP file processing | |
| 453 | +| **xml2js** | ^0.6.2 | MIT | XML parsing and conversion | |
435 | 454 |
|
436 | 455 | ### License Compatibility |
437 | 456 | - ✅ **Commercial Use**: All dependencies support commercial use |
|
0 commit comments