MCP HydroCoder Image 🍌

AI image generation and editing MCP server for Cursor, Claude Code, Codex, and any MCP-compatible tool — powered by Nano Banana 2 and Nano Banana Pro (Google Gemini).

An MCP server that turns simple text prompts into high-quality images. Unlike a simple API wrapper, this server automatically enhances your prompt and configures sensible defaults for generation — you don't need to learn prompt engineering or tune settings. Just describe what you want.

How It Works

You: "cat on a roof"
        ↓
  Your AI assistant infers context
  (purpose, style, mood, resolution...)
        ↓
  MCP optimizes your prompt
  (adds lighting, composition, atmosphere, artistic details)
        ↓
  Image generation with smart defaults
  (grounding, consistency, resolution — all configured automatically)
        ↓
  High-quality image, zero effort

Your AI assistant interprets your intent — the style, purpose, and context behind your request. The MCP focuses on output quality by refining the prompt to meet a structured visual clarity standard and selecting appropriate generation settings. You just describe what you want.

The prompt optimizer uses a Subject–Context–Style framework (powered by Gemini 2.5 Flash) to fill in missing visual details — subject characteristics, environment, lighting, camera work — while preserving your original intent. It doesn't blindly add details: prompts that already meet the quality standard are left largely intact.

Example — what the optimizer does to a short prompt:

Input: "cat on a roof"

After optimization: "A sleek, midnight black cat, perched with poised elegance on the apex of a weathered, terracotta tile roof. Its emerald eyes, narrowed slightly, reflect the warm glow of a setting sun. Each individual tile is distinct, showing subtle variations in color and texture, with patches of moss clinging to the crevices. The cat's fur is sharply defined, catching the golden hour light, highlighting its sleek contours. In the background, the silhouettes of distant, old-world city buildings with ornate spires are softly blurred, bathed in a gradient of fiery orange, soft pink, and deep violet twilight. A gentle, ethereal mist begins to rise from the alleyways below, adding a touch of mystery. The composition is a medium shot, taken from a slightly low angle, emphasizing the cat's commanding presence against the vast sky. Photorealistic style, captured with a prime lens, wide aperture to create a beautiful bokeh, enhancing the depth of field."

Features

Built-in Prompt Optimization: Your simple prompt is automatically enriched with photographic and artistic details — lighting, composition, atmosphere — using Gemini 2.5 Flash. No prompt engineering skills required.
Three Quality Tiers: Choose between fast iteration, balanced quality, or maximum fidelity with Nano Banana 2 (Gemini 3.1 Flash Image) and Nano Banana Pro (Gemini 3 Pro Image). See Quality Presets.
Image Editing: Transform existing images with natural language instructions (image-to-image) while preserving original style and visual consistency.
High-Resolution Output: Up to 4K image generation for professional-grade output with superior text rendering and fine details.
Flexible Aspect Ratios: From square (1:1) to ultra-wide (21:9) and ultra-tall (1:8) formats.
Character Consistency: Maintain consistent character appearance across multiple generations — ideal for storyboards, product shots, and visual series.
Advanced Capabilities:
- Google Search grounding for real-time factual accuracy
- World knowledge for photorealistic depictions of historical figures, landmarks, and factual scenarios
- Multi-image blending for composite scenes
- Purpose-aware generation (e.g., "cookbook cover" produces different results than "social media post")
Multiple Output Formats: PNG, JPEG, WebP support.

Agent Skill: Image Generation Prompt Guide

This project also provides a standalone Agent Skill (SKILL.md) that teaches AI assistants to write better image generation prompts — no MCP server or API key required.

Note: This skill does not generate images itself. It teaches your AI assistant to write better prompts for tools that already have built-in image generation (e.g., Cursor's native image generation).

Based on the Subject-Context-Style framework, covering prompt structure, visual details (lighting, textures, camera angles), advanced techniques (character consistency, composition), and image editing. Works with any image model (Gemini, GPT Image, Flux, Stable Diffusion, Midjourney, etc.).

Install

npx mcp-hydrocoder-image skills install --path <target-directory>

The skill will be placed at <path>/image-generation/SKILL.md. Specify the skills directory for your AI tool:

# Cursor
npx mcp-hydrocoder-image skills install --path ~/.cursor/skills

# Codex
npx mcp-hydrocoder-image skills install --path ~/.codex/skills

# Claude Code
npx mcp-hydrocoder-image skills install --path ~/.claude/skills

When to Use the Skill vs the MCP Server

	MCP Server	Agent Skill
Use when	Your AI tool does not have built-in image generation	Your AI tool already generates images natively
Requires	Gemini API key	Nothing
What it does	Generates images via Gemini API with automatic prompt optimization	Teaches the AI to write better prompts
Works with	MCP-compatible tools (Cursor, Claude Code, Codex, etc.)	Any tool supporting the Agent Skills open standard

Prerequisites

Node.js 20 or higher
Gemini API Key - Get yours at Google AI Studio
An MCP-compatible AI tool: Cursor, Claude Code, Codex, or others
Basic terminal/command line knowledge

Quick Start

1. Get Your Gemini API Key

Get your API key from Google AI Studio

2. MCP Configuration

For Codex

Add to ~/.codex/config.toml:

[mcp_servers.mcp-hydrocoder-image]
command = "npx"
args = ["-y", "mcp-hydrocoder-image"]

[mcp_servers.mcp-hydrocoder-image.env]
GEMINI_API_KEY = "your_gemini_api_key_here"
IMAGE_OUTPUT_DIR = "/absolute/path/to/images"

For Cursor

Add to your Cursor settings:

Global (all projects): ~/.cursor/mcp.json
Project-specific: .cursor/mcp.json in your project root

macOS / Linux:

{
  "mcpServers": {
    "mcp-hydrocoder-image": {
      "command": "npx",
      "args": ["-y", "mcp-hydrocoder-image"],
      "env": {
        "GEMINI_API_KEY": "your_gemini_api_key_here",
        "IMAGE_OUTPUT_DIR": "/absolute/path/to/images"
      }
    }
  }
}

Windows:

{
  "mcpServers": {
    "mcp-hydrocoder-image": {
      "command": "npx",
      "args": ["-y", "mcp-hydrocoder-image"],
      "env": {
        "GEMINI_API_KEY": "your_gemini_api_key_here",
        "IMAGE_OUTPUT_DIR": "C:\\absolute\\path\\to\\images"
      }
    }
  }
}

For Claude Code

Run in your project directory to enable for that project:

cd /path/to/your/project
claude mcp add mcp-hydrocoder-image --env GEMINI_API_KEY=your-api-key --env IMAGE_OUTPUT_DIR=/absolute/path/to/images -- npx -y mcp-hydrocoder-image

Or add globally for all projects:

claude mcp add mcp-hydrocoder-image --scope user --env GEMINI_API_KEY=your-api-key --env IMAGE_OUTPUT_DIR=/absolute/path/to/images -- npx -y mcp-hydrocoder-image

Or add via JSON config (~/.claude/settings.json for global, .mcp.json for project):

macOS / Linux:

{
  "mcpServers": {
    "mcp-hydrocoder-image": {
      "command": "npx",
      "args": ["-y", "mcp-hydrocoder-image"],
      "env": {
        "GEMINI_API_KEY": "your_gemini_api_key_here",
        "IMAGE_OUTPUT_DIR": "/absolute/path/to/images"
      }
    }
  }
}

Windows:

{
  "mcpServers": {
    "mcp-hydrocoder-image": {
      "command": "npx",
      "args": ["-y", "mcp-hydrocoder-image"],
      "env": {
        "GEMINI_API_KEY": "your_gemini_api_key_here",
        "IMAGE_OUTPUT_DIR": "C:\\absolute\\path\\to\\images"
      }
    }
  }
}

⚠️ Security Note: Never commit your API key to version control. Keep it secure and use environment-specific configuration.

📁 Path Requirements:

IMAGE_OUTPUT_DIR must be an absolute path (e.g., /Users/username/images, not ./images)
Defaults to ./output in the current working directory if not specified
Directory will be created automatically if it doesn't exist

Quality Presets

Choose the right balance of speed, quality, and cost:

Preset	Model	Best for	Speed
`fast` (default)	Nano Banana 2 (Gemini 3.1 Flash Image)	Quick iterations, drafts, high-volume generation	~30–40s
`balanced`	Nano Banana 2 + Thinking	Production images, good quality with reasonable speed	Medium
`quality`	Nano Banana Pro (Gemini 3 Pro Image)	Final deliverables, maximum fidelity, critical visuals	Slow

Set the default via IMAGE_QUALITY environment variable:

IMAGE_QUALITY=fast       # (default) Fastest generation
IMAGE_QUALITY=balanced   # Enhanced thinking for better quality
IMAGE_QUALITY=quality    # Maximum quality output

To override per-request, just tell your AI assistant (e.g., "generate in high quality" or "use balanced quality"). The assistant will pass the appropriate quality parameter automatically.

Codex:

[mcp_servers.mcp-hydrocoder-image.env]
GEMINI_API_KEY = "your_gemini_api_key_here"
IMAGE_QUALITY = "balanced"

Cursor: Add "IMAGE_QUALITY": "balanced" to the env section in your config.

Claude Code:

claude mcp add mcp-hydrocoder-image --env GEMINI_API_KEY=your-api-key --env IMAGE_QUALITY=balanced --env IMAGE_OUTPUT_DIR=/absolute/path/to/images -- npx -y mcp-hydrocoder-image

Skip Prompt Enhancement

Control prompt enhancement via the skipPromptEnhancement tool parameter or the SKIP_PROMPT_ENHANCEMENT environment variable. Parameter takes priority over environment variable.

Parameter	Env Variable	Result
Not set	Not set	Enhancement enabled (default)
Not set	`true`	Enhancement skipped
`true`	Any	Enhancement skipped
`false`	`true`	Enhancement enabled (parameter overrides)

Skipping enhancement is recommended for multi-image blending, where the prompt enhancer may rewrite your blending intent into unrelated content.

Usage Examples

Once configured, just describe what you want in natural language:

Basic Image Generation

"Generate a serene mountain landscape at sunset with a lake reflection"

Your prompt is automatically enhanced with rich details about lighting, materials, composition, and atmosphere.

Image Editing

"Edit this image to make the person face right"
(with inputImagePath: "/path/to/image.jpg")

Advanced Features

Character Consistency:

"Generate a portrait of a medieval knight, maintaining character consistency for future variations"
(with maintainCharacterConsistency: true)

High-Resolution 4K with Text Rendering:

"Generate a professional product photo of a smartphone with clear text on the screen"
(with imageSize: "4K")

Custom Aspect Ratio:

"Generate a cinematic landscape of a desert at golden hour"
(with aspectRatio: "21:9")

API Reference

`generate_image` Tool

The server uses a two-stage process with separate models for each stage:

Prompt Optimization (Gemini 2.5 Flash): Refines your prompt using the Subject–Context–Style framework. Skippable via SKIP_PROMPT_ENHANCEMENT.
Image Generation (Nano Banana 2 or Pro): Creates the final image. Model varies by quality preset.

Parameters

Parameter	Type	Required	Description
`prompt`	string	Yes	Text description or editing instruction
`quality`	string	-	Quality preset: `fast` (default), `balanced`, `quality`. Overrides `IMAGE_QUALITY` env var for this request
`inputImagePath`	string	-	Absolute path to input image for image-to-image editing
`inputImage`	string	-	Base64 encoded image data for image-to-image editing. Alternative to `inputImagePath`
`inputImageMimeType`	string	-	MIME type of the input image (`image/jpeg`, `image/png`, `image/webp`, `image/gif`, `image/bmp`). Used with `inputImage`
`inputImages`	array	-	Multiple input images for multi-image composition. Each item: `{ data: string, mimeType: string }`. Cannot be used with `inputImage`/`inputImagePath`/`inputImagePaths`
`inputImagePaths`	array	-	Multiple input image file paths for multi-image composition. Each item is an absolute path string. Cannot be used with other image input params
`returnBase64`	boolean	-	Return the generated image as base64 data in the response. Image is always saved to disk regardless
`fileName`	string	-	Custom filename for output (auto-generated if not specified). Extension is auto-appended based on output format if omitted
`skipPromptEnhancement`	boolean	-	Skip prompt enhancement and use the prompt as-is. Recommended for multi-image blending. Overrides `SKIP_PROMPT_ENHANCEMENT` env var. Default: `false`
`aspectRatio`	string	-	`1:1` (default), `2:3`, `3:2`, `3:4`, `4:3`, `4:5`, `5:4`, `9:16`, `16:9`, `21:9`, `1:4`, `1:8`, `4:1`, `8:1`
`imageSize`	string	-	`1K`, `2K`, `4K`. Leave unspecified for standard quality
`blendImages`	boolean	-	Enable multi-image blending for combining multiple visual elements naturally
`maintainCharacterConsistency`	boolean	-	Maintain character appearance consistency across different poses and scenes
`useWorldKnowledge`	boolean	-	Use real-world knowledge for accurate context (historical figures, landmarks, factual scenarios)
`useGoogleSearch`	boolean	-	Enable Google Search grounding for real-time factual accuracy
`purpose`	string	-	Intended use (e.g., "cookbook cover", "social media post"). Helps tailor visual style and details

Response

{
  "type": "resource",
  "resource": {
    "uri": "file:///path/to/generated/image.png",
    "name": "image-filename.png",
    "mimeType": "image/png"
  },
  "metadata": {
    "model": "gemini-3.1-flash-image-preview",
    "processingTime": 5000,
    "timestamp": "2026-01-01T12:00:00.000Z"
  }
}

Troubleshooting

Common Issues

"API key not found"

Ensure GEMINI_API_KEY is set in your environment
Verify the API key is valid and has image generation permissions

"Input image file not found"

Use absolute file paths, not relative paths
Ensure the file exists and is accessible
Supported formats: PNG, JPEG, WebP (max 10MB)

"No image data found in Gemini API response"

Try rephrasing your prompt with more specific details
Ensure your prompt is appropriate for image generation
Check if your API key has sufficient quota

Performance Tips

fast preset: ~30–40 seconds typical (includes prompt optimization)
balanced preset: Slightly longer due to enhanced thinking
quality preset: Slower but highest fidelity output
High-resolution (2K/4K): Additional processing time for superior detail
Simple prompts work great — the optimizer automatically adds professional details
Complex prompts are preserved and further enhanced
Consider useWorldKnowledge for historical or factual subjects
Use imageSize: "4K" when text clarity and fine details are critical

Usage Notes

This MCP server uses the paid Gemini API:
- Prompt optimization: Gemini 2.5 Flash (minimal token usage)
- Image generation: Model depends on quality preset
  - fast / balanced: Nano Banana 2 — Gemini 3.1 Flash Image (lower cost)
  - quality: Nano Banana Pro — Gemini 3 Pro Image (higher cost)
- balanced uses additional thinking tokens (slightly higher cost than fast)
Check current pricing and rate limits at Google AI Studio
Monitor your API usage to avoid unexpected charges
The prompt optimization step adds minimal cost while significantly improving output quality

Local Development

If you want to test a local build (e.g., after cloning the repo or making changes), follow these steps instead of using npx.

1. Build the Project

cd /path/to/mcp-hydrocoder-image
npm install
npm run build

2. Configure the MCP Server

Add the following to your MCP configuration file, pointing directly to the local dist/index.js:

Claude Code: ~/.claude.json
Cursor: ~/.cursor/mcp.json or .cursor/mcp.json
Codex: ~/.codex/config.toml

Claude Code / Cursor (JSON):

{
  "mcp-hydrocoder-image": {
    "command": "node",
    "args": ["/absolute/path/to/mcp-hydrocoder-image/dist/index.js"],
    "env": {
      "GEMINI_API_KEY": "your_gemini_api_key_here",
      "IMAGE_OUTPUT_DIR": "/absolute/path/to/output"
    }
  }
}

Proxy users: If you need a proxy, add HTTPS_PROXY and HTTP_PROXY to the env section.

3. Allow Tool Permissions (Claude Code)

To skip the permission prompt on every call, add the tool to your allow list in ~/.claude/settings.json:

{
  "permissions": {
    "allow": [
      "mcp__mcp-hydrocoder-image__generate_image"
    ]
  }
}

4. Restart and Test

Restart your AI tool to load the new MCP server, then try:

"Generate a serene mountain landscape at sunset"

If the image is saved to your IMAGE_OUTPUT_DIR, the local setup is working correctly.

License

MIT License - see LICENSE for details.

Need help? Open an issue or check the troubleshooting section above.

@ynzys/mcp-banana

Quick Install

MCP HydroCoder Image 🍌

How It Works

Features

Agent Skill: Image Generation Prompt Guide

Install

When to Use the Skill vs the MCP Server

Prerequisites

Quick Start

1. Get Your Gemini API Key

2. MCP Configuration

For Codex

For Cursor

For Claude Code

Quality Presets

Skip Prompt Enhancement

Usage Examples

Basic Image Generation

Image Editing

Advanced Features

API Reference

`generate_image` Tool

Parameters

Response

Troubleshooting

Common Issues

Performance Tips

Usage Notes

Local Development

1. Build the Project

2. Configure the MCP Server

3. Allow Tool Permissions (Claude Code)

4. Restart and Test

License

Reviews

@ynzys/mcp-banana

Quick Install

MCP HydroCoder Image 🍌

How It Works

Features

Agent Skill: Image Generation Prompt Guide

Install

When to Use the Skill vs the MCP Server

Prerequisites

Quick Start

1. Get Your Gemini API Key

2. MCP Configuration

For Codex

For Cursor

For Claude Code

Quality Presets

Skip Prompt Enhancement

Usage Examples

Basic Image Generation

Image Editing

Advanced Features

API Reference

generate_image Tool

Parameters

Response

Troubleshooting

Common Issues

Performance Tips

Usage Notes

Local Development

1. Build the Project

2. Configure the MCP Server

3. Allow Tool Permissions (Claude Code)

4. Restart and Test

License

Reviews

`generate_image` Tool