MCP Hub
Back to servers

verifiable-thinking-mcp

A sophisticated reasoning MCP server that prevents LLM hallucination through 15 pattern-based trap detectors, automatic self-challenges, and contradiction detection.

Stars
1
Tools
5
Updated
Jan 16, 2026

Quick Install

npx -y verifiable-thinking-mcp
Verifiable Thinking MCP

Your LLM is confidently wrong 40% of the time on reasoning questions. This fixes that.

npm version CI codecov License: MIT

15 trap patterns detected in <1ms. No LLM calls. Just pattern matching.

Quick StartFeaturesTrap DetectionAPI


┌────────────────────────────────────────────────────────────────┐
│ "A bat and ball cost $1.10. The bat costs $1 more..."          │
│                             ↓                                  │
│ TRAP DETECTED: additive_system                                 │
│ > Don't subtract $1 from $1.10. Set up: x + (x+1) = 1.10       │
│                             ↓                                  │
│ Answer: $0.05 (not $0.10)                                      │
└────────────────────────────────────────────────────────────────┘

Quick Start

npx -y verifiable-thinking-mcp

Add to Claude Desktop (claude_desktop_config.json):

{
  "mcpServers": {
    "verifiable-thinking": {
      "command": "npx",
      "args": ["-y", "verifiable-thinking-mcp"]
    }
  }
}

Features

🎯 Trap Detection15 patterns (bat-ball, Monty Hall, base rate) caught before reasoning starts
⚔️ Auto-ChallengeForces counterarguments when confidence >95%—no more overconfident wrong answers
🔍 Contradiction DetectionCatches "Let x=5" then "Now x=10" across steps
🌿 Hypothesis BranchingExplore alternatives, auto-detects when branches confirm/refute
🔢 Local MathEvaluates expressions without LLM round-trips
🗜️ CompressionQuery-aware context compression for long chains

How It Works

// Start with a question—trap detection runs automatically
scratchpad({
  operation: "step",
  question: "A bat and ball cost $1.10...",
  thought: "Let ball = x, bat = x + 1.00",
  confidence: 0.9
})
// → Returns trap_analysis warning

// High confidence? Auto-challenge kicks in
scratchpad({ operation: "step", thought: "...", confidence: 0.96 })
// → Returns challenge_suggestion: "What if your assumption is wrong?"

// Complete with spot-check
scratchpad({ operation: "complete", final_answer: "$0.05" })

Trap Detection

PatternWhat It Catches
additive_systemBat-ball, widget-gadget (subtract instead of solve)
nonlinear_growthLily pad doubling (linear interpolation)
monty_hallDoor switching (50/50 fallacy)
base_rateMedical tests (ignoring prevalence)
independenceCoin flips (gambler's fallacy)
All 15 patterns
PatternTrap
additive_systemSubtract instead of solve
nonlinear_growthLinear interpolation
rate_patternIncorrect scaling
harmonic_meanArithmetic mean for rates
independenceGambler's fallacy
pigeonholeUnderestimate worst case
base_rateIgnore prevalence
factorial_countingSimple division
clock_overlapAssume 12 overlaps
conditional_probabilityIgnore conditioning
conjunction_fallacyMore detail = more likely
monty_hall50/50 after reveal
anchoringIrrelevant number influence
sunk_costPast investment bias
framing_effectGain/loss framing

Tools

scratchpad — the main tool with 11 operations:

OperationWhat It Does
stepAdd reasoning step (trap priming on first)
completeFinalize with auto spot-check
reviseFix earlier step
branchExplore alternative path
challengeForce adversarial self-check
navigateView history/branches
All operations
OperationPurpose
stepAdd reasoning step
completeFinalize chain
reviseFix earlier step
branchAlternative path
challengeAdversarial self-check
navigateView history
spot_checkManual trap check
hintProgressive simplification
mistakesAlgebraic error detection
augmentCompute math expressions
overrideForce-commit failed step

Other tools: list_sessions, get_session, clear_session, compress

vs Sequential Thinking MCP

Sequential ThinkingVerifiable Thinking
Trap detection15 patterns
Auto-challenge>95% confidence
Contradiction detection
Confidence trackingPer-step + chain
Local compute
Token budgetsSoft + hard limits

Sequential Thinking is ~100 lines. This is 22,000+ with 1,831 tests.

See docs/competitive-analysis.md for full breakdown.

Development

git clone https://github.com/CoderDayton/verifiable-thinking-mcp.git
cd verifiable-thinking-mcp && bun install
bun run dev      # Interactive MCP Inspector
bun test         # 1,831 tests

License

MIT


Reviews

No reviews yet

Sign in to write a review