MCP Hub
Back to servers

@iflow-mcp/zengram-mcp

Requires Setup

Persistent multi-path memory for AI agents — vector + BM25 keyword + entity graph search with RRF fusion, credential scrubbing, auto-consolidation, multi-backend storage (Qdrant + SQLite/Postgres)

npm46/wk
Stars
20
Forks
4
Updated
Apr 1, 2026
Validated
Apr 10, 2026

Quick Install

npx -y @iflow-mcp/zengram-mcp

ZenSystem

Zengram

Shared memory for multi-agent AI systems.

Quick StartHow It WorksBenchmarksAdaptersAPI DocsConfig

CI npm License: MIT Node 20+ Docker MCP GitHub stars

Zengram — shared memory for multi-agent AI systems

Store a fact from Claude Code on your laptop, recall it from an autonomous agent on your server, get a briefing from n8n — all through the same memory system. Born from a production setup where nothing existed that let multiple AI agents share memory across separate machines.


The Problem

Before and after shared memory

You run multiple AI agents — Claude Code for development, autonomous agents for tasks, n8n for automation. They each maintain their own context and forget everything between sessions. When one agent discovers something important, the others never learn about it.

How It Works

Typed Memory

4 Memory Types

Events are immutable history. Facts upsert by key — new facts supersede old ones. Statuses track current state. Decisions record choices and reasoning. Each type has its own lifecycle, decay rules, and mutation semantics.

Dual Storage

Dual Database Design

Every memory lives in two places: Qdrant for semantic vector search and SQLite/Postgres for structured queries, entity graphs, and full-text BM25 search. Get both "find memories similar to X" and "give me all facts with key Y" from the same system.

Multi-Path Search

Search runs three retrieval paths in parallel, fused with Reciprocal Rank Fusion:

  1. Vector search — Cosine similarity via Qdrant
  2. Keyword search — BM25 via Postgres tsvector or SQLite FTS5
  3. Entity graph — BFS traversal through relationship graph

Items found by multiple paths get boosted. 98.4% retrieval accuracy on LongMemEval.

Built for Multi-Agent

  • Cross-agent briefings — "What happened since I was last here?" returns updates from all other agents
  • Agent-scoped API keys — Each agent gets its own identity and permissions
  • Cross-agent corroboration — Two agents storing the same fact = corroboration, not duplication
  • Credential scrubbing — API keys, JWTs, passwords automatically redacted before storage
  • Entity extraction — Regex + alias cache on every write, LLM refinement on consolidation
  • LLM consolidation — Periodic background process merges duplicates, flags contradictions, discovers connections

Benchmarks

LongMemEval Benchmark Results

Evaluated on LongMemEval, the academic benchmark for long-term conversational memory:

Score
Retrieval accuracy (finding the right memories)98.4%
QA accuracy (GPT-4o answering from retrieved context)76.0%
Full-context GPT-4o (entire history in prompt, no retrieval)72.4%

The benchmark uses cosine similarity only — none of the API's multi-path features (BM25, entity graph, temporal boost) were used. Full methodology and per-category breakdown.

LongMemEval tests single-agent chat recall. Zengram is built for multi-agent coordination — features like cross-agent briefings, typed memory, entity graphs, and credential scrubbing aren't measured by this benchmark but are core to production use.

How It Compares

FeatureZengramMem0LettaZepHindsight
Cross-machine by designYesCloud onlyNoCloud onlyNo
Typed memory (event/fact/status/decision)YesNoNoNoNo
Multi-path search (vector+BM25+graph)YesVector onlyVector onlyHybridYes
Cross-agent corroborationYesNoNoNoNo
Session briefingsYesNoNoNoNo
Credential scrubbingYesNoNoNoNo
Entity extraction + linkingYesGraph (Pro)NoYesNo
LLM consolidationYesInlineSelf-managedNoReflect
Temporal validityYesNoNoYesNo
MCP server includedYesCommunityNoNoYes
Self-hostable (fully open)YesCommunity ed.YesGraphiti onlyYes

Quick Start

git clone https://github.com/ZenSystemAI/zengram.git
cd zengram

cp .env.example .env
# Edit .env — set BRAIN_API_KEY and your embedding provider key

docker compose up -d

# Verify
curl http://localhost:8084/health

# Store your first memory
curl -X POST http://localhost:8084/memory \
  -H "Content-Type: application/json" \
  -H "X-Api-Key: YOUR_KEY" \
  -d '{
    "type": "fact",
    "content": "The API uses port 8084 by default",
    "source_agent": "my-agent",
    "key": "api-default-port"
  }'

Adapters & SDKs

MCP Server (Claude Code, Cursor, Windsurf)

14 tools: brain_store, brain_search, brain_briefing, brain_query, brain_stats, brain_consolidate, brain_entities, brain_delete, brain_client, brain_export, brain_import, brain_graph, brain_reflect, brain_update.

{
  "mcpServers": {
    "zengram": {
      "command": "node",
      "args": ["/path/to/zengram/mcp-server/src/index.js"],
      "env": {
        "BRAIN_API_URL": "http://localhost:8084",
        "BRAIN_API_KEY": "your-key"
      }
    }
  }
}

Or install via npm: npm install -g @zensystemai/zengram-mcp

Python SDK

pip install zengram
from zengram import BrainClient

client = BrainClient("http://localhost:8084", api_key="your-key")
client.store(type="fact", content="Production DB is on db-prod-1", source_agent="devops", key="prod-db")
results = client.search("database configuration")

Claude Code Skills

Copy adapters/claude-code/sessionend/ to your project's .claude/skills/ to get the /sessionend ritual — structured session reflections stored directly to Zengram. Full guide.

Bash CLI, n8n, OpenClaw

Documentation

DocDescription
API ReferenceEvery endpoint with request/response examples
ArchitectureSystem design, data flows, component inventory
ConfigurationAll environment variables
Data ModelMemory types, decay, dedup, supersedes logic
MCP ToolsThe 14 MCP tools agents use
OperationsDeployment, monitoring, failure modes
BenchmarksFull LongMemEval methodology and results
Examplescurl demo, Python client, multi-agent scenario

Roadmap

Recently shipped: Web dashboard, Python SDK, SSE subscriptions, multi-collection support, on-demand LLM reflection, temporal validity, multi-path RRF search, entity graph visualization — full changelog

Coming next: Automatic memory capture, TypeScript SDK, hosted docs, LangChain/LlamaIndex integration

Contributing

Contributions welcome! See CONTRIBUTING.md.

See Also

License

MIT — see LICENSE.


Built by ZenSystem AI

Reviews

No reviews yet

Sign in to write a review