MCP Hub
Back to servers

Vision MCP

An image analysis server that leverages Vision Language Models via OpenAI-compatible APIs to process and interpret visual data from both URLs and local files.

Tools
1
Updated
Dec 26, 2025

Vision MCP

MCP server for image analysis using Vision Language Models.

Quickstart

  1. Install uv (Python package manager):

    curl -LsSf https://astral.sh/uv/install.sh | sh
    
  2. Configure your MCP client (e.g., Claude Desktop):

Go to Claude > Settings > Developer > Edit Config > claude_desktop_config.json:

{
  "mcpServers": {
    "Vision": {
      "command": "uvx",
      "args": ["vision-mcp"],
      "env": {
        "OPENAI_API_KEY": "your-api-key",
        "OPENAI_API_BASE": "https://api.openai.com",
        "OPENAI_MODEL": "gpt-4o"
      }
    }
  }
}

Environment Variables

VariableRequiredDescription
OPENAI_API_KEYYesAPI key for authentication
OPENAI_API_BASEYesAPI base URL
OPENAI_MODELYesModel name for vision tasks

Available Tools

ToolDescription
analyze_imageAnalyze images using Vision Language Model

analyze_image

Analyze and understand image content from files or URLs.

Parameters:

  • prompt (str): The text prompt describing what to analyze
  • image_source (str): Image URL or local file path

Supported formats: JPEG, PNG, WebP

License

MIT

Acknowledgments

This project is inspired by MiniMax-Coding-Plan-MCP by MiniMax AI.

Reviews

No reviews yet

Sign in to write a review