Agent Orchestration & Security Template

An AI safety engineering portfolio combining production tooling, detection research, and risk analysis—structured as a reusable template.

MCP Demo

Important: This is an advanced template repository designed for experienced developers working with autonomous AI agents. Before diving in, we strongly recommend:

Read the AI Safety Training Guide - Essential concepts for safe human-AI collaboration, including deception detection, scalable oversight, and control protocols

Take an AI Safety course at BlueDot Impact - Free, rigorous training programs covering AI safety fundamentals, governance, and alignment. Highly recommended for anyone building with autonomous agents.

Working with AI agents introduces risks that differ fundamentally from traditional software. Understanding these risks isn't optional - it's a prerequisite for responsible development.

Project Philosophy

This project follows a container-first approach:

All tools and CI/CD operations run in Docker containers for maximum portability
Zero external dependencies - runs on any Linux system with Docker
Self-hosted infrastructure - no cloud costs, full control over runners
Single maintainer design - optimized for individual developer productivity
Modular MCP architecture - Separate specialized servers for different functionalities

Quick Start

New to the template? Check out our Template Quickstart Guide for step-by-step customization instructions!

Prerequisites: Linux system with Docker (v20.10+) and Docker Compose (v2.0+)

Clone and setup

git clone https://github.com/AndrewAltimit/template-repo
cd template-repo
pip3 install -e ./packages/github_agents

Set API keys (if using AI features)

export OPENROUTER_API_KEY="your-key-here"  # For OpenCode/Crush
export GEMINI_API_KEY="your-key-here"      # For Gemini

Use with Claude Code: MCP servers are configured in .mcp.json and auto-started by Claude. See MCP Configuration for essential vs full setups.

Run CI/CD operations

./automation/ci-cd/run-ci.sh full  # Full pipeline

For detailed setup, see CLAUDE.md and Template Quickstart Guide.

AI Agents

Six AI agents for development and automation. See AI Agents Documentation for details.

Agent	Provider	Use Case	Documentation
Claude Code	Anthropic	Primary development assistant	Setup Guide
Codex	OpenAI	Code generation	Setup Guide
OpenCode	OpenRouter	Code generation	AI Code Agents
Crush	OpenRouter	Code generation	AI Code Agents
Gemini	Google	Code review (limited tool use)	Setup Guide
GitHub Copilot	GitHub	PR review suggestions	-

All code generation agents (Codex, OpenCode, Crush) provide equivalent functionality - choose based on your API access.

Security: Keyword triggers, user allow list, secure token management. See Security Model

Safety Training: Essential AI safety concepts for human-AI collaboration. See Human Training Guide

Sleeper Agents: Create and evaluate sleeper agents in order to detect misalignment and probe for deception. See Sleeper Agents Package

Agentic Git Workflow

AI agents autonomously manage the development lifecycle from issue creation through PR merge:

Issue Created → Admin Approval → Agent Claims → PR Created → AI Review → Human Merge

The Flow:

Issue Creation - Issues are created manually or by agents via backlog-refinement.yml, automatically added to the GitHub Projects board
Admin Approval - An authorized user comments [Approved][Claude] (or another agent name) to authorize work
Agent Claims - board-agent-worker.yml finds approved issues, the agent claims the issue and creates a working branch
Implementation - The agent implements the fix/feature and opens a PR
AI Review - pr-validation.yml triggers Gemini code review; pr-review-monitor.yml lets agents iterate on feedback
Human Merge - Admin reviews and merges the PR

Security Model:

Approval Required - Agents cannot work on issues without explicit [Approved][Agent] comment
Authorized Users Only - Only users listed in .agents.yaml → security.agent_admins can approve
Pattern Validation - Must use [Action][Agent] format (e.g., [Approved][Claude]) to prevent false positives
Claim Tracking - Agents post claim comments with timestamps to prevent conflicts

See Security Documentation for the complete security model.

Reports & Research

Technical reports and guides exploring AI risks, safety frameworks, and philosophical questions. PDFs are automatically built from LaTeX source and published with each release.

Emerging Technology Risk Assessments

Scenario-based projection reports analyzing potential futures involving advanced AI systems. See Projections Documentation.

Report	Topic	PDF	Source
AI Agents Political Targeting	Political violence risk	Download	LaTeX
AI Agents WMD Proliferation	WMD proliferation risk	Download	LaTeX
AI Agents Espionage Operations	Intelligence tradecraft	Download	LaTeX
AI Agents Economic Actors	Autonomous economic actors	Download	LaTeX
AI Agents Financial Integrity	Money laundering & corruption	Download	LaTeX
AI Agents Institutional Erosion	IC monopoly erosion & verification pivot	Download	LaTeX

Technical Guides

Guide	Description	PDF	Source
Agentic Workflow Handout	AI agent pipeline architecture and workflows	Download	LaTeX
Sleeper Agents Framework	AI backdoor detection using residual stream analysis	Download	LaTeX
AgentCore Memory Integration	Multi-provider AI memory system	Download	LaTeX
Virtual Character System	AI agent embodiment platform	Download	LaTeX

Philosophy Papers

Philosophical explorations of minds, experience, and intelligence. See Philosophy Papers Documentation.

Paper	Topic	PDF	Source
Architectural Qualia	What Is It Like to Be an LLM?	Download	LaTeX

Build Status:

Packages

Four standalone packages addressing different aspects of AI agent development and safety:

Package	Purpose	Documentation
GitHub Agents	Multi-agent orchestration for autonomous GitHub workflows - issue monitoring, PR review processing, and board coordination with Claude, Codex, OpenCode, Gemini, and Crush	README \| Security
Sleeper Agents	Production-validated detection framework for hidden backdoors in LLMs, based on Anthropic's research on deceptive AI that persists through safety training	README \| PDF Guide
Economic Agents	Simulation framework demonstrating autonomous AI economic capability - agents that earn money, form companies, hire sub-agents, and seek investment. For governance research and policy development	README
Injection Toolkit	Cross-platform Rust framework for runtime integration - DLL injection (Windows), LD_PRELOAD (Linux), shared memory IPC, and overlay rendering. For game modding, debugging tools, and AI agent embodiment	README \| Architecture

# Install Python packages
pip install -e ./packages/github_agents
pip install -e ./packages/sleeper_agents
pip install -e ./packages/economic_agents

# Build Rust package (requires Rust toolchain)
cd packages/injection_toolkit && cargo build --release

Features

18 MCP Servers - Code quality, content creation, AI assistance, 3D graphics, video editing, speech synthesis, and more
6 AI Agents - Autonomous development workflow from issue to merge
4 Packages - GitHub automation, sleeper agent detection, economic agent simulation, runtime injection toolkit
Container-First Architecture - Maximum portability and consistency
Self-Hosted CI/CD - Zero-cost GitHub Actions infrastructure
Company Integration - Corporate proxy builds for enterprise AI APIs (Docs)

Enterprise & Corporate Setup

For enterprise environments requiring custom certificates, customize automation/corporate-proxy/shared/scripts/install-corporate-certs.sh. This script runs during Docker builds for all containers. See the customization guide for details.

Project Structure

.
├── .github/workflows/        # GitHub Actions workflows
├── docker/                   # Docker configurations
├── packages/                 # Installable packages
│   ├── github_agents/        # Multi-agent GitHub automation
│   ├── sleeper_agents/       # AI backdoor detection framework
│   ├── economic_agents/      # Autonomous economic agents
│   └── injection_toolkit/    # Rust runtime injection framework
├── tools/
│   ├── mcp/                  # 18 MCP servers (see MCP Servers section)
│   └── cli/                  # Agent runners and utilities
├── automation/               # CI/CD and automation scripts
├── tests/                    # Test files
├── docs/                     # Documentation
└── config/                   # Configuration files

MCP Servers

Available Servers

Code Quality - Formatting, linting, auto-formatting
Content Creation - Manim animations, LaTeX, TikZ diagrams
Gaea2 - Terrain generation (Documentation)
Blender - 3D content creation, rendering, physics simulation (Documentation)
Gemini - AI consultation (containerized and host modes available)
Codex - AI-powered code generation and completion
OpenCode - Code generation via OpenRouter
Crush - Code generation via OpenRouter
Meme Generator - Create memes with templates
ElevenLabs Speech - Advanced TTS with v3 model, 50+ audio tags, 74 languages (Documentation)
Video Editor - AI-powered video editing with transcription and scene detection (Documentation)
Virtual Character - AI agent embodiment in virtual worlds (VRChat, Blender, Unity) (Documentation)
GitHub Board - GitHub Projects v2 board management, work claiming, agent coordination (Documentation)
AI Toolkit - LoRA training interface (remote: 192.168.0.222:8012)
ComfyUI - Image generation interface (remote: 192.168.0.222:8013)
AgentCore Memory - Multi-provider AI memory (AWS AgentCore or ChromaDB) (Documentation)
Reaction Search - Semantic search for anime reaction images (Documentation)
Desktop Control - Cross-platform desktop automation for Linux and Windows (Documentation)

Usage Modes

STDIO Mode (local MCPs): Configured in .mcp.json, auto-started by Claude
HTTP Mode (remote MCPs): Run the MCP using docker-compose on the remote node.

See MCP Architecture Documentation and STDIO vs HTTP Modes for details.

Tool Reference

For complete tool listings, see MCP Tools Reference

Configuration

Environment Variables

See .env.example for all available options.

Key Configuration Files

.mcp.json - MCP server configuration for Claude Code
docker-compose.yml - Container services configuration
CLAUDE.md - Project-specific Claude Code instructions (root directory)
AGENTS.md - Universal AI agent configuration and guidelines (root directory)
docs/agents/project-context.md - Context for AI reviewers

Setup Guides

Development Workflow

Container-First Development

All Python operations run in Docker containers:

# Run CI operations
./automation/ci-cd/run-ci.sh format      # Check formatting
./automation/ci-cd/run-ci.sh lint-basic  # Basic linting
./automation/ci-cd/run-ci.sh test        # Run tests
./automation/ci-cd/run-ci.sh full        # Full CI pipeline

# Run specific tests
docker-compose run --rm python-ci pytest tests/test_mcp_tools.py -v

GitHub Actions

Pull Request Validation - Automatic Gemini AI review
Continuous Integration - Full CI pipeline
Code Quality - Multi-stage linting (containerized)
Automated Testing - Unit and integration tests
Security Scanning - Bandit and safety checks

All workflows run on self-hosted runners for zero-cost operation.

Documentation

Core Documentation

AGENTS.md - Universal AI agent configuration and guidelines
CLAUDE.md - Claude-specific instructions and commands
MCP Architecture - Modular server design
AI Agents Documentation - AI agents overview

Quick References

Integration Guides

Setup & Configuration

Template Quickstart Guide - Customize the template for your needs
Self-Hosted Runner Setup
GitHub Environments Setup
Containerized CI

License

This project is released under the Unlicense (public domain dedication).

For jurisdictions that do not recognize public domain: As a fallback, this project is also available under the MIT License.

template-repo