engram

Persistent memory for AI agents. Store, forget, and recall — like a brain.

engram demo — store, context reset, recall

Why?

Most agent memory is just a vector database. Everything stays forever, retrieval is dumb similarity search, and there's no concept of what matters or how things relate.

engram organizes memory along two dimensions:

Time — what survives

A three-layer lifecycle inspired by how human memory actually works:

Layer	Role	Behavior
Buffer	Short-term intake	All new memories land here. Unreinforced entries decay naturally
Working	Active knowledge	Promoted when accessed repeatedly or tagged as lessons/procedures
Core	Long-term identity	Promoted through sustained usage and LLM quality gate. Procedures never decay

You just store memories. The system figures out what's important and promotes it. Unused memories fade. Lessons persist. Duplicates reinforce instead of accumulating.

Space — what connects

A self-organizing topic tree clusters related memories by semantic similarity, with LLM-powered naming:

Memory Architecture
├── Three-layer lifecycle [4]
├── Embedding pipeline [3]
└── Consolidation logic [5]
Deploy & Ops
├── CI/CD procedures [3]
└── Production incidents [2]
User Preferences [6]
Git Workflow [4]

The tree is hierarchical — related topics nest under shared parents. On session start, your agent gets a flattened index of leaf topics (the knowledge directory). See something relevant? Drill into it with POST /topic. The tree rebuilds automatically as memories change.

Time tells you what matters. Space tells you what's related. Together, they give an agent something closer to understanding than a flat list of embeddings ever could.

Single binary, ~10 MB, ~100 MB RSS in production. No Docker, no Python, no external database.

How It Works

sequenceDiagram
    participant A as Agent
    participant E as engram
    participant LLM as LLM

    Note over A,E: Session Start
    A->>E: GET /resume
    E-->>A: core + recent + topics + triggers

    Note over A,E: During Work
    A->>E: POST /memories {decision}
    A->>E: POST /recall {query}
    E-->>A: ranked results
    A->>E: POST /topic {"ids":["kb3"]}
    E-->>A: full topic memories
    A->>E: GET /triggers/deploy
    E-->>A: relevant lessons

    Note over E,LLM: Background (30min cycle)
    E->>E: decay, dedup, promote
    E->>LLM: triage / gate / merge
    E->>E: rebuild topic tree
    E->>LLM: name new topics

    Note over E,LLM: Audit (12h cycle)
    E->>LLM: review memory quality
    LLM-->>E: promote, demote, merge

Requirements

An OpenAI-compatible or Anthropic-native API for embeddings and LLM calls. Set ENGRAM_LLM_URL and ENGRAM_LLM_KEY. For Anthropic, also set ENGRAM_LLM_PROVIDER=anthropic.

Without an LLM, engram still works — keyword search and heuristic lifecycle only. You lose the smart parts (triage, merge, topic naming, audit).

Install

# macOS / Linux
curl -fsSL https://raw.githubusercontent.com/kael-bit/engram-rs/main/install.sh | bash

# Windows (PowerShell)
irm https://raw.githubusercontent.com/kael-bit/engram-rs/main/install.ps1 | iex

The installer walks you through download, configuration, and startup interactively.

For AI agents — paste this into your session:

Set up engram (persistent memory) by following the guide at:
https://raw.githubusercontent.com/kael-bit/engram-rs/main/docs/SETUP.md

Features

Session Recovery

One call to restore context on wake-up or after compaction:

Section	What it gives you
Core	Full text of permanent rules and identity — never truncated
Recent	Recently changed memories for short-term continuity
Topics	Named topic index — browse by subject, drill into any cluster
Triggers	Pre-action safety tags — auto-surface lessons before risky operations

Hybrid Search

Semantic embeddings + BM25 keyword search (with jieba for CJK) + fact-triple lookup → single ranked result set. Dual-hit boosting when multiple signals agree. Embedding cache for repeat queries (<15ms).

Memory Types

Kind	Decay	Use Case
`semantic`	Normal	Facts, preferences, knowledge (default)
`episodic`	Normal	Events, experiences, time-bound context
`procedural`	Never	Workflows, instructions, how-to — persists indefinitely

Fact Triples

Structured (subject, predicate, object) relationships extracted from memories. Contradictions are automatically resolved — new facts supersede old ones with timestamps.

Namespace Isolation

One instance, multiple projects. Each workspace gets its own memory space with shared access to a default namespace for cross-project knowledge.

Background Maintenance

Fully autonomous — no cron needed:

Consolidation (30min): promote, decay, dedup, merge, rebuild topic tree
Audit (12h): LLM reviews memory quality, merges duplicates, demotes stale entries

MCP & API

17 MCP tools for Claude Code, Cursor, Windsurf, OpenClaw, and others — see docs/MCP.md.

HTTP API and configuration: docs/SETUP.md.

License

MIT

engram

Quick Install

engram

Why?

Time — what survives

Space — what connects

How It Works

Requirements

Install

Features

Session Recovery

Hybrid Search

Memory Types

Fact Triples

Namespace Isolation

Background Maintenance

MCP & API

License

Reviews