promptspeak-mcp-server

Pre-execution governance for AI agents. Blocks dangerous tool calls before they execute.

AI agents call tools (file writes, API requests, shell commands) with no validation layer between intent and execution. A prompt injection, hallucinated argument, or drifting goal can trigger irreversible actions. PromptSpeak intercepts every MCP tool call, validates it against deterministic rules, and blocks or holds risky operations for human approval — in 0.1ms, before anything executes.

PromptSpeak Governance Demo

When to use this

You run AI agents that call tools (MCP servers, function calling, tool use) and need a governance layer between the agent and the tools.
You need human-in-the-loop approval for high-risk operations (production deployments, financial transactions, legal filings).
You want to detect behavioral drift — an agent gradually shifting away from its assigned task.
You need an audit trail of every tool call an agent attempted, whether it was allowed or blocked.
You operate in a regulated domain (legal, financial, healthcare) where agent actions must be deterministically constrained.

Quick start

npm install promptspeak-mcp-server

Claude Desktop / Claude Code

Add to your MCP configuration (claude_desktop_config.json or .claude/settings.json):

{
  "mcpServers": {
    "promptspeak": {
      "command": "npx",
      "args": ["promptspeak-mcp-server"]
    }
  }
}

From source

git clone https://github.com/chrbailey/promptspeak-mcp-server.git
cd promptspeak-mcp-server
npm install && npm run build
npm start

How it works: 8-stage validation pipeline

Every tool call passes through this pipeline. If any stage fails, execution is blocked.

Agent calls tool
  │
  ├─ 1. Circuit Breaker ──── Halted agents blocked instantly (no further checks)
  ├─ 2. Frame Validation ─── Structural, semantic, and chain rule checks
  ├─ 3. Drift Prediction ─── Pre-flight behavioral anomaly detection
  ├─ 4. Hold Check ────────── Risky operations held for human approval
  ├─ 5. Interceptor ───────── Final permission gate (confidence thresholds)
  ├─ 6. Tool Execution ────── Only reached if all 5 pre-checks pass
  ├─ 7. Post-Audit ────────── Confirms behavior matched prediction
  └─ 8. Immediate Action ──── Halts agent if critical drift detected post-execution

Stages 1-5 are pre-execution — the tool never runs if any check fails. Stages 7-8 are post-execution — they detect drift and can halt the agent for future calls.

MCP tools (40)

Core governance

Tool	When to call it	What it does
`ps_validate`	Before executing any agent action	Validate a frame against all rules without executing
`ps_validate_batch`	When checking multiple actions at once	Batch validation for efficiency
`ps_execute`	When an agent wants to perform a tool call	Full pipeline: validate → hold check → execute → audit
`ps_execute_dry_run`	When previewing what would happen	Run full pipeline without executing the tool

Human-in-the-loop holds

Tool	When to call it	What it does
`ps_hold_list`	When reviewing pending agent actions	List all operations awaiting human approval
`ps_hold_approve`	When a held operation should proceed	Approve with optional modified arguments
`ps_hold_reject`	When a held operation should be denied	Reject with reason
`ps_hold_config`	When tuning which operations require approval	Configure hold triggers and thresholds
`ps_hold_stats`	When monitoring hold queue health	Hold queue statistics

Agent lifecycle

Tool	When to call it	What it does
`ps_state_get`	When checking what an agent is doing	Get agent's active frame and last action
`ps_state_system`	When monitoring overall system health	System-wide statistics
`ps_state_halt`	When an agent must be stopped immediately	Trip circuit breaker — blocks all future calls
`ps_state_resume`	When a halted agent should be allowed to continue	Reset circuit breaker
`ps_state_reset`	When clearing agent state	Full state reset
`ps_state_drift_history`	When investigating behavioral changes	Drift detection alert history

Delegation

Tool	When to call it	What it does
`ps_delegate`	When an agent spawns a sub-agent	Create parent→child delegation with constrained permissions
`ps_delegate_revoke`	When revoking a sub-agent's authority	Remove delegation
`ps_delegate_list`	When auditing delegation chains	List active delegations

Configuration

Tool	When to call it	What it does
`ps_config_set`	When changing governance rules at runtime	Set configuration key-value pairs
`ps_config_get`	When reading current configuration	Get current config
`ps_config_activate`	When switching policy profiles	Activate a named configuration
`ps_config_export`	When backing up configuration	Export full config as JSON
`ps_config_import`	When restoring configuration	Import config from JSON
`ps_confidence_set`	When tuning validation strictness	Set confidence thresholds
`ps_confidence_get`	When checking current thresholds	Get current thresholds
`ps_confidence_bulk_set`	When reconfiguring multiple thresholds	Batch threshold update
`ps_feature_set`	When toggling pipeline stages	Enable/disable specific checks
`ps_feature_get`	When checking which stages are active	Get feature flags

Symbol registry (entity tracking)

Tool	When to call it	What it does
`ps_symbol_create`	When registering a new entity (company, person, system)	Create symbol with type, metadata, and tags
`ps_symbol_get`	When looking up an entity	Retrieve by ID
`ps_symbol_update`	When entity data changes	Update metadata or tags
`ps_symbol_list`	When browsing entities by type	List with optional type filter
`ps_symbol_delete`	When removing an entity	Delete by ID
`ps_symbol_import`	When bulk-loading entities	Batch import
`ps_symbol_stats`	When monitoring registry health	Registry statistics
`ps_symbol_format`	When displaying an entity	Format symbol for display
`ps_symbol_verify`	When confirming entity data is current	Mark symbol as verified
`ps_symbol_list_unverified`	When auditing stale data	List symbols needing verification
`ps_symbol_add_alternative`	When an entity has aliases	Add alternative identifier

Audit

Tool	When to call it	What it does
`ps_audit_get`	When reviewing what happened	Full audit trail with filters

Architecture

src/
├── gatekeeper/       # 8-stage validation pipeline (core enforcement)
│   ├── index.ts      #   Pipeline orchestrator + agent eviction policy
│   ├── validator.ts  #   Frame structural/semantic/chain validation
│   ├── interceptor.ts#   Permission gate with confidence thresholds
│   ├── hold-manager.ts#  Human-in-the-loop hold queue
│   ├── resolver.ts   #   Frame resolution with operator overrides
│   └── coverage.ts   #   Coverage confidence calculator
├── drift/            # Behavioral drift detection
│   ├── circuit-breaker.ts  # Per-agent halt/resume
│   ├── baseline.ts         # Behavioral baseline comparison
│   ├── tripwire.ts         # Anomaly tripwires
│   └── monitor.ts          # Continuous monitoring
├── symbols/          # SQLite-backed entity registry (11 CRUD tools)
├── policies/         # Policy file loader + overlay system
├── operator/         # Operator configuration
├── tools/            # MCP tool implementations
│   ├── registry.ts   #   29 core tools
│   └── ps_hold.ts    #   5 hold tools
├── handlers/         # Tool dispatch + metadata registry
├── core/             # Logging, errors, result patterns
└── server.ts         # MCP server entry point (stdio transport)

Performance

Metric	Value
Validation latency	0.103ms avg (P95: 0.121ms)
Operations/second	6,977
Holds/second	33,333
Test suite	563 tests, 16 test files

Requirements

Node.js >= 20.0.0
TypeScript 5.3+ (build from source)
No external services required — SQLite for symbols, in-memory for everything else

License

MIT

promptspeak-mcp-server

Quick Install

promptspeak-mcp-server

When to use this

Quick start

Claude Desktop / Claude Code

From source

How it works: 8-stage validation pipeline

MCP tools (40)

Core governance

Human-in-the-loop holds

Agent lifecycle

Delegation

Configuration

Symbol registry (entity tracking)

Audit

Architecture

Performance

Requirements

License

Reviews