MCP Hub
Back to servers

MiniRAG

A retrieval-augmented generation server that provides both naive vector searching and graph-based retrieval algorithms to improve accuracy for small language models.

Stars
1
Updated
Nov 28, 2025

MINIRAG-MCP

An MCP Server wrapper around MiniRAG that leverages LLM sampling for client-managed inference.

Core function

MiniRag is an MCP project geared towards efficient and reliable retrieval by small language models). Minirag-MCP makes MiniRAG easier to plug into on-device agent flows.

Additional bells and whistles

  • UV management of dependencies that avoid complications in minirag setup
  • LLM sampling (inference calls managed by MCP client) with built-in constrained decoding for enhanced reliability

Feature roadmap

  • RAG retrieval
    • naive retrieval algorithm, implementing a basic query of the vector database
    • minirag algorithm, which operates on graph
  • basic insertion (and reindexing)
  • advanced insertion of content from the web

Quick Start

uv run server.py

Reviews

No reviews yet

Sign in to write a review