Context Foundry

What if you could describe what you want to build, walk away, and come back to working code?

Context Foundry is an autonomous build system that does the tedious parts for you. Give it a task, and it researches, designs, builds, tests, and deploys—without you babysitting every step.

Quick Start | Documentation | Desktop App

Context Foundry Desktop - Monitor builds, view artifacts, track progress

AI Sidekick - Chat with your builds, ask questions, get insights

What is Context Foundry?

Most AI coding assistants need you to hold their hand—approving every file change, fixing their mistakes, keeping them on track. Context Foundry works differently.

You describe what you want. It figures out the rest.

You: "Build an interactive math game for kids learning multiplication"

[Go grab coffee]

Result: Working React app with 4 difficulty levels, progress tracking,
        animated feedback, and tests—all pushed to GitHub.

Behind the scenes, it spawns specialized AI agents for each phase: one to research, one to design the architecture, one to write code, one to test. Each agent has fresh context and a focused job.

How It Works

Each build runs through phases, and each phase gets a fresh AI agent with its own context window:

Scout — Researches the problem, existing patterns, and constraints
Architect — Designs the solution structure, makes technical decisions, and writes Gherkin acceptance criteria
Builder — Writes the actual code against the acceptance criteria
Test — Validates each Gherkin scenario, catches failures, triggers fixes if needed

If tests fail, it loops back and fixes itself. No manual intervention required.

The key insight: instead of one AI that runs out of context, you get specialized agents that each do one thing well and pass artifacts to the next.

Key Features

Feature	What it does
Self-Healing Builds	Tests fail? It automatically fixes and retries.
Pattern Learning	Remembers solutions that worked, avoids mistakes it's made before. Uses semantic deduplication to prevent duplicate patterns.
Spec Mode	Have a design doc? Skip the AI brainstorming—build directly from your specification files (PDF, Word, Markdown, images).
Human-in-the-Loop	Review Gherkin acceptance criteria before Builder starts. Clear sign-off gates.
Desktop App	Visual dashboard to watch builds, browse artifacts, chat with the AI.
Daemon Service	Runs in the background, manages job queues, handles resource limits.

Extensions

Want to build Roblox games? Flowise workflows? Something domain-specific?

Extensions let you teach Context Foundry your domain. Add patterns, examples, and constraints to extensions/<your-domain>/ and it'll reference them during builds.

Spec Mode: Build from Your Documents

Already have a design document, requirements spec, or wireframes? Spec Mode lets you skip the AI brainstorming and build directly from your specifications.

How It Works

Normal Mode:  Scout → Architect → Builder → Tester
              (AI researches) (AI designs)

Spec Mode:           Architect → Builder → Tester
                     (extracts from YOUR spec)

In Spec Mode:

Scout is skipped — Your spec replaces Scout's requirements analysis
Architect extracts — Reads your spec and fills the standard template (doesn't invent)
Builder implements — Builds exactly what your spec describes
Tester validates — Tests against Gherkin criteria extracted from your spec

Supported File Formats

Format	Extensions	Notes
Plain text	`.txt`, `.md`, `.json`, `.yaml`, `.xml`	Built-in
PDF	`.pdf`	Requires `pypdf` (see below)
Word	`.docx`	Requires `python-docx` (see below)
Images	`.png`, `.jpg`, `.gif`, `.webp`	Diagrams, wireframes, mockups

Enable PDF/Word support:

pip install -r requirements-spec.txt
# Or: pip install pypdf python-docx

How to Run a Spec Mode Build

In Claude Code (natural language):

Build a dashboard app using the spec at ~/Documents/dashboard-spec.pdf
Output to ~/builds/my-dashboard

Multiple spec files:

Build using these specs:
- ~/Documents/requirements.md
- ~/Documents/wireframes.png
- ~/Documents/api-design.pdf

Working directory: ~/builds/my-app

Programmatically (MCP tool):

autonomous_build_and_deploy(
    task="Build a dashboard application",
    working_directory="/Users/me/builds/dashboard",
    spec_files=[
        "/Users/me/Documents/dashboard-spec.pdf",
        "/Users/me/Documents/wireframes.png"
    ]
)

Can Spec Mode Combine with Human-in-the-Loop (HIL)?

Yes! Spec Mode and HIL are independent features that work together:

Mode	What It Controls
Spec Mode	Input source — Where requirements come from (your files vs AI research)
HIL Mode	Approval gates — When to pause for human review

Combined usage:

Build from spec ~/Documents/spec.pdf with human-in-the-loop review

Output to ~/builds/my-app

This will:

Skip Scout (Spec Mode)
Architect extracts from your PDF
Pause for your approval of the architecture (HIL)
Builder implements after you approve
Pause for your approval of the code (HIL)
Tester validates

Pattern Learning: Smarter Over Time

Context Foundry learns from every build. When an agent solves a problem, the solution gets saved to the pattern library. Future builds benefit from past learnings.

How Patterns Work

Build 1: Agent encounters CORS error → Learns solution → Saves pattern
Build 2: Same CORS error → Agent reads pattern → Applies known fix instantly

Patterns are stored in ~/.context-foundry/patterns/:

File	Contains	Used By
`common-issues.json`	Bug fixes, gotchas, workarounds	Builder, Test
`architecture-patterns.json`	Design patterns, best practices	Architect
`test-patterns.json`	Testing strategies, E2E patterns	Test
`scout-learnings.json`	Research insights, domain knowledge	Scout

Semantic Deduplication (New)

Problem: LLMs generate different names for the same issue. Without deduplication, you'd get:

"cors-error-fix"
"fix-cors-issue"
"cross-origin-request-blocked"

All describing the same solution—wasting storage and causing confusion.

Solution: When merging patterns, Context Foundry uses Claude to semantically compare new patterns against existing ones. If they describe the same issue (even with different wording), it updates the existing pattern instead of creating a duplicate.

New pattern: "polling-race-condition-bug"
Existing:    "daemon-job-status-race-condition"

Claude: "These describe the same issue"
Result: Update existing pattern (frequency +1), don't create duplicate

Cost: ~$0.008 per semantic check (Opus 4.5). Only runs when exact ID doesn't match.

Latency: <2 seconds per check. Most builds have 0-2 new patterns, so impact is minimal.

Pattern Library Commands

# View current patterns
cat ~/.context-foundry/patterns/common-issues.json | jq '.patterns | length'

# Sync patterns to cloud (optional)
# Uses S3 for team sharing

Running Builds

Regular Autonomous Build

In Claude Code:

Build a weather dashboard with React

Via CLI:

cfd start                    # Start daemon
cf build "Weather dashboard with React"
cfd logs <job-id> --follow   # Watch progress

What happens:

Scout researches the task
Architect designs the solution
Builder writes the code
Tester validates (loops if tests fail)
Done!

Spec Mode Build

In Claude Code:

Build from spec at ~/Documents/my-spec.pdf
Output to ~/builds/my-project

What happens:

~~Scout~~ (skipped — your spec is the source)
Architect extracts from your spec (doesn't invent)
Builder implements what the spec describes
Tester validates
Done!

Human-in-the-Loop (HIL) Build

In Claude Code:

Build a payment system with human-in-the-loop review

What happens:

Scout researches → Pause for approval
Architect designs → Pause for approval
Builder implements → Pause for approval
Tester validates
Done!

Combined: Spec Mode + HIL

Build from spec ~/Documents/spec.md with HIL review
Output to ~/builds/project

Architecture

sequenceDiagram
    participant U as User
    participant O as Orchestrator
    participant S as Scout Agent
    participant A as Architect Agent
    participant B as Builder Agent
    participant T as Test Agent
    participant FS as .context-foundry/

    U->>O: Task Description

    rect rgb(225, 245, 254)
        Note over O,S: Scout Phase
        O->>S: spawn claude --system-prompt phase_scout.txt
        S->>FS: Write scout_report.json
        S->>O: Return scout-report.md
        O->>O: Parse MD -> SCOUT_JSON
    end

    rect rgb(232, 245, 233)
        Note over O,A: Architect Phase
        O->>A: spawn claude --system-prompt phase_architect.txt<br/>+ SCOUT_JSON injected
        A->>FS: Write architecture.json
        A->>O: Return architecture.md
        O->>O: Parse MD -> ARCHITECTURE_JSON
    end

    rect rgb(255, 243, 224)
        Note over O,B: Builder Phase
        O->>B: spawn claude --system-prompt phase_builder.txt<br/>+ ARCHITECTURE_JSON injected
        B->>FS: Write build-tasks.json
        B->>FS: Write source code files
        B->>O: Return completion status
    end

    rect rgb(252, 228, 236)
        Note over O,T: Test Phase
        O->>T: spawn claude --system-prompt phase_test.txt<br/>+ ARCHITECTURE_JSON.test_plan
        T->>FS: Write test-report-N.md
        T->>O: Return PASSED/FAILED
    end

    alt Tests PASSED
        O->>U: Build Complete
    else Tests FAILED
        O->>A: Re-run with failure context
        Note over A,T: Loop: Architect Fix -> Builder Fix -> Test
    end

Each phase spawns a fresh Claude instance with isolated context, preventing token bloat and ensuring consistent quality across long builds.

Understanding Phases and Agents

A common question: Are Scout, Architect, Builder, etc. "agents" or "phases"?

Answer: Both. They are phases from an orchestration perspective, and ephemeral agent instances from an execution perspective.

How It Works

┌─────────────────────────────────────────────────────────────────┐
│                     DAEMON (Orchestrator)                        │
│  runner.py manages pipeline state, spawns agents sequentially   │
└─────────────────────────────────────────────────────────────────┘
                              │
        ┌─────────────────────┼─────────────────────┐
        ▼                     ▼                     ▼
   ┌─────────┐          ┌─────────┐          ┌─────────┐
   │  Scout  │    →     │Architect│    →     │ Builder │  → ...
   │ Agent   │          │ Agent   │          │ Agent   │
   └─────────┘          └─────────┘          └─────────┘
   200K tokens          200K tokens          200K tokens
   (ephemeral)          (ephemeral)          (ephemeral)
        │                     │                     │
        ▼                     ▼                     ▼
   scout-prompt.json    architect-prompt.json  builder-prompt.json
   scout-report.md      architecture.md        (code files)

Key Characteristics

Aspect	Reality
Context Window	Each phase gets its own fresh 200K tokens
Lifecycle	Ephemeral - spawned, runs, exits, context gone
Communication	Via disk artifacts (not shared memory)
Implementation	Each is a `claude` CLI subprocess
State	Persisted in `.context-foundry/` between phases

Why This Design?

Fresh context every phase — Most builds use only 30-50K tokens per phase, leaving plenty of headroom
Isolation — If Builder crashes, you don't lose Scout's analysis
Resumability — Can restart from any phase since state lives on disk
Focus — Each agent has one job and a tailored prompt for it

They're essentially stateless workers that read artifacts, do work, write artifacts, and disappear.

Desktop App

Features

Feature	Description
Visual Dashboard	See all jobs at a glance with status, duration, and phase progress
Live Duration Counter	Real-time timer showing build progress in seconds
AI Sidekick Chat	Natural language interface to check status, trigger builds, and get help
Phase Timeline	Visual progress through Scout → Architect → Builder → Test phases
Conversation View	See what the AI is thinking during each phase
Artifact Browser	View generated code with syntax highlighting and line numbers
Dark Theme	Native dark mode with beautiful purple accents

Install Desktop App

# Download from releases (macOS)
curl -L https://github.com/your-org/context-foundry/releases/latest/download/ContextFoundry.dmg -o ContextFoundry.dmg
open ContextFoundry.dmg

# Or build from source
cd apps/context-foundry-desktop
npm install
npm run tauri:build

See Desktop App Documentation for full details.

Quick Start

Option 1: Desktop App (Recommended)

Download and install the Desktop App (see above)
Launch Context Foundry Desktop
Use the Sidekick chat to start your first build:
```
"Build a todo app with React"
```

Option 2: CLI

# Install via npm
npm install -g context-foundry

# Start the daemon
cfd start

# Build something
cf build "Create a weather dashboard with React"

This installs the package and automatically configures Claude Code. (Alternative: pip install context-foundry && cf setup)

Option 3: Claude Code Integration

In Claude Code, just ask in natural language:

"Use CF to build a weather dashboard with React"

Walk away. Come back to deployed code on GitHub.

See Quick Start Guide for detailed setup instructions.

CLI Tools

cfd start             # Start the daemon
cfd status            # Check status
cfd logs <job-id>     # View build logs
cfd list              # List active builds
cfd stop              # Stop the daemon

Documentation

Guide	Description
Quick Start	Get running in 5 minutes
Desktop App	Native macOS/Windows application
Dashboard	Web dashboard development guide
User Guide	Detailed usage instructions
Architecture	How it works under the hood
Features	Complete feature reference
Innovations	Technical breakthroughs explained
Phase Handoff Flow	Inter-phase data contracts
Troubleshooting	Common issues and fixes
FAQ	Frequently asked questions

Extension Development

Guide	Description
Extension Development	Create domain-specific extensions
Teaching New Languages	Teach Claude languages not in its training data

File Structure

context-foundry/
├── apps/
│   └── context-foundry-desktop/  # Native macOS/Windows desktop app (Tauri)
├── tools/
│   ├── dashboard/             # React dashboard frontend
│   ├── mcp_server.py          # MCP server entry point
│   ├── mcp_utils/             # Build orchestration, delegation, patterns
│   ├── prompts/phases/        # Phase-specific system prompts
│   ├── evolution/             # Daemon, self-improvement, safety
│   ├── cli.py                 # Main CLI (cf command)
│   └── cfd                    # Daemon CLI script
├── context_foundry/
│   └── daemon/                # Python daemon (HTTP API, job management)
├── extensions/                # Domain-specific extensions
├── npm/                       # npm package wrapper
├── docs/                      # Documentation
└── CLAUDE.md                  # Instructions for AI agents

Contributing

We welcome contributions. See Contributing Guide for details.

License

MIT License. See LICENSE for details.

context-foundry

Quick Install

Context Foundry

What is Context Foundry?

How It Works

Key Features

Extensions

Spec Mode: Build from Your Documents

How It Works

Supported File Formats

How to Run a Spec Mode Build

Can Spec Mode Combine with Human-in-the-Loop (HIL)?

Pattern Learning: Smarter Over Time

How Patterns Work

Semantic Deduplication (New)

Pattern Library Commands

Running Builds

Regular Autonomous Build

Spec Mode Build

Human-in-the-Loop (HIL) Build

Combined: Spec Mode + HIL

Architecture

Understanding Phases and Agents

How It Works

Key Characteristics

Why This Design?

Desktop App

Features

Install Desktop App

Quick Start

Option 1: Desktop App (Recommended)

Option 2: CLI

Option 3: Claude Code Integration

CLI Tools

Documentation

Extension Development

File Structure

Contributing

License

Reviews