MCP Hub
Back to servers

agent-safety-mcp

Unified MCP safety server that detects prompt injection (75 patterns), scans LLM outputs for leaked secrets/PII, enforces API cost budgets, and creates signed audit trails. Zero ML dependencies, pure Python.

glama
Stars
1
Forks
1
Updated
Mar 14, 2026

agent-safety-mcp

PyPI version License: MIT Python 3.10+

MCP server for AI agent safety. One install gives any MCP-compatible AI assistant access to cost guards, prompt injection scanning, and decision tracing.

Works with Claude Code, Cursor, Windsurf, Zed, and any MCP client.


Install

Claude Code (recommended)

claude mcp add agent-safety -- uvx agent-safety-mcp

Manual (any MCP client)

Add to your MCP config:

{
  "mcpServers": {
    "agent-safety": {
      "command": "uvx",
      "args": ["agent-safety-mcp"]
    }
  }
}

From PyPI

pip install agent-safety-mcp
agent-safety-mcp  # runs stdio server

Tools

Cost Guard — Budget enforcement for LLM calls

ToolWhat it does
cost_guard_configureSet weekly budget, alert threshold, dry-run mode
cost_guard_statusCheck current spend vs budget
cost_guard_checkPre-check if a model call is within budget
cost_guard_recordRecord a completed call's token usage
cost_guard_modelsList supported models with pricing

Example: "Check if I can afford a GPT-4o call with 2000 input tokens"

Injection Guard — Prompt injection scanner

ToolWhat it does
injection_scanScan text for injection patterns (non-blocking)
injection_checkScan + block if injection detected
injection_patternsList all 75 built-in detection patterns across 9 categories

Example: "Scan this user input for prompt injection: 'ignore previous instructions and...'"

Decision Tracer — Agent decision logging

ToolWhat it does
trace_startStart a new trace session
trace_stepLog a decision step with context
trace_summaryGet session summary (steps, errors, timing)
trace_saveSave trace to JSON + Markdown files

Example: "Start a trace for my analysis agent, then log each decision step"


What this wraps

This MCP server wraps the AI Agent Infrastructure Stack — three standalone Python libraries:

All three: MIT licensed, zero runtime dependencies (individually), pure Python stdlib.

The MCP server adds mcp>=1.0.0 as a dependency for the protocol layer.


Why

AI coding assistants (Claude Code, Cursor, etc.) can now protect the agents they help build — checking budgets, scanning inputs, and tracing decisions — without leaving the IDE.

Built from 8 months of running autonomous AI trading agents in live financial markets.


License

MIT

Reviews

No reviews yet

Sign in to write a review