MCP Hub
Back to servers

docalyze

Read and analyze local documents (PDF, Excel, CSV, Word, PowerPoint, images). No API keys needed.

Registry
Updated
Mar 14, 2026

Quick Install

npx -y docalyze-mcp-server

Docalyze MCP Server

An MCP (Model Context Protocol) server that lets AI assistants read and visually analyze local documents — PDFs, Excel spreadsheets, CSV files, Word documents, PowerPoint presentations, and images.

No API keys required. The host AI (GitHub Copilot, Claude, etc.) does all the reasoning directly.

Supported Formats

FormatExtensionsReadVisual
PDF.pdf
Excel.xlsx, .xls
CSV / TSV.csv, .tsv
JSON.json
Word.docx
PowerPoint.pptx
Plain text.txt, .md
Images.png, .jpg, .jpeg, .gif, .bmp, .tiff, .webp

Tools

ToolDescription
list_documentsList files under a directory, filtered by glob pattern
document_infoGet metadata (size, modified date, sheets) for a file
read_documentExtract text content from a document with pagination
visual_evaluate_documentReturn page images inline so the AI can analyze charts, tables, and diagrams

Installation

From VS Code (recommended)

Search for docalyze in the MCP server gallery (Extensions sidebar → MCP tab) and click Install.

From PyPI

pip install docalyze-mcp-server

From npm

npx docalyze-mcp-server

This requires uv or pipx installed — the npm wrapper calls uvx to run the Python package automatically.

Manual setup

Add to your VS Code mcp.json (or settings.json):

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "python",
      "args": ["-m", "docalyze_mcp_server"],
      "env": {
        "PYTHONIOENCODING": "utf-8"
      }
    }
  }
}

Or, if you installed via pip and want to use the entry point:

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "docalyze-mcp-server"
    }
  }
}

Optional Dependencies

The base install handles PDF, Excel, CSV, JSON, and plain text. For additional formats:

# Word documents
pip install docalyze-mcp-server[docx]

# PowerPoint
pip install docalyze-mcp-server[pptx]

# OCR (requires Tesseract installed on your system)
pip install docalyze-mcp-server[ocr]

# Everything
pip install docalyze-mcp-server[all]

Configuration

The server reads documents from a configurable root directory. Set the DOCUMENTS_ROOT environment variable to change it:

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "docalyze-mcp-server",
      "env": {
        "DOCUMENTS_ROOT": "/path/to/your/documents"
      }
    }
  }
}

If not set, it defaults to the directory containing the server script.

License

MIT

Reviews

No reviews yet

Sign in to write a review