What is PrismRAG and how is it different from Graph RAG?

Standard Graph RAG derives relationships from document co-occurrence statistics — the documents decide which concepts are related. PrismRAG inverts this: you define a mapping (categories + word rules), and all data is embedded into your model of the world. This gives every client a completely different knowledge graph from the same documents, with full audit traceability from mapping rule to retrieval result.

What ML algorithms does PrismRAG use?

PrismRAG uses four core ML components: (1) Gemini text-embedding-004 for 768-dimensional semantic embeddings; (2) Tier-1 RulesStrategy: deterministic 768→256-d linear projection with 30% category one-hot signal + 70% random orthonormal projection, L2-normalised; (3) Tier-2 MLPStrategy: a 3-layer PyTorch MLP (768→512→512→256) trained with InfoNCE contrastive loss plus anchor repulsion, using AdamW + CosineAnnealingLR; (4) Louvain community detection on the word graph, with parallel Gemini LLM labeling of each community.

What is the Deliberation product and how does it work?

Deliberation is a three-phase AI reasoning pipeline for complex questions that span multiple domains. Phase 1 (Horizontal): LLM + knowledge base community search discovers the top 7 most relevant domains. Phase 2 (Vertical): all 7 domains are queried in parallel as domain experts, each getting a targeted deep prompt with optional KB grounding. Phase 3 (Synthesis): a Master Deliberator compares all findings, identifies agreements, surfaces conflicts between domain logic, flags unique insights, and produces a final synthesised answer. Unlike single-model RAG, genuine conflicts emerge from independent domain reasoning.

Does PrismRAG support MCP (Model Context Protocol) for AI agents?

Yes. PrismRAG ships a built-in MCP server exposing 9 tools: search, list_workspaces, list_communities, get_job_status, submit_job, create_bridge, deliberate, get_deliberation_session, and deliberation_followup. AI assistants like Claude, ChatGPT, and custom agents can connect via stdio or HTTP transport.

How is pricing structured?

PrismRAG (knowledge graph): Free $0 (5,000 chunks), Starter $49/mo (50,000 chunks), Professional $199/mo (500,000 chunks + MLP training + bridge vectors), Enterprise custom. Deliberation: Free $0 (5 deliberations), Starter $29/mo (50 deliberations), Professional $99/mo (500 deliberations), Enterprise custom. Overage: $0.25 per deliberation. Both products are available together on all paid plans.

Enterprise AI platform

Stop letting documents
define your knowledge graph.

PrismRAG maps your data into your semantic model — auditable rules, personal ML, millisecond search. Deliberation runs seven domain experts in parallel, then synthesises agreement, conflict, and insight.

Start free — no card required Explore products

Knowledge Graph Deliberation

# Your mapping — not document statistics
POST /api/prismrag/jobs

{
  "strategy": "mlp",
  "mapping": {
    "categories": [
      { "slug": "risk",   "label": "Risk & Compliance" },
      { "slug": "growth", "label": "Growth" }
    ],
    "rules": [
      { "word": "volatility", "category_slug": "risk" }
    ]
  }
}

Knowledge graph with your categories at the center

API-first

REST + webhooks

9 MCP tools

Agent-native

<50ms

HNSW vector search

7 experts

Parallel deliberation

100%

Audit traceability

Two Products

Knowledge you control. Reasoning you can trust.

One platform for building enterprise knowledge graphs and running multi-domain AI deliberation — included on every plan.

PrismRAG

Define which concepts belong together. Train a personal MLP on your rules. Search a graph built from your domain expertise — not document co-occurrence.

Custom mapping PyTorch MLP Graph RAG

See how it works →

Deliberation

Ask one hard question. Seven domain experts answer in parallel. A Master Deliberator surfaces agreements, real conflicts, and insights no single model would find.

7 parallel experts Conflict detection MCP-native

See the pipeline →

Why PrismRAG

Your team should own the graph — not the corpus

Graph RAG infers relationships from how often words appear together. PrismRAG embeds data into the structure you define.

Standard Graph RAG

PrismRAG

✗ Standard Graph RAG

Relationships extracted from document statistics
"bank" near "river bank" in old docs → wrong cluster for finance
Different clients get structurally the same graph
Wrong edge? Re-index all documents
LLM hallucinations baked permanently into graph
Black-box community assignments — not auditable

Data → Relationships

✓ PrismRAG

You define which words belong to which community
"bank" → finance cluster because your mapping says so
Per-client isolation — same docs, different graphs
Wrong edge? Update one rule row, re-run job
Every assignment traces to a versioned rule
Full audit trail from mapping rule → chunk → retrieval

Mapping → Data embedded into it

How PrismRAG Works

From your rules to intelligent retrieval

Three steps. Full audit trail. No black-box graph construction.

⬡

Define your mapping

Send a category table and word→group rules via API, CSV, SQL query, or Excel file. This is your domain model — explicit, versioned, and auditable.

Tier 1 · Rules

→

◈

We re-map the space

Your rules train a personal MLP (Tier 2) that learns the boundaries of your categories and generalises them to unseen vocabulary. All source text is projected through your model.

Tier 2 · MLP Training

→

◎

Retrieval reflects your intent

Queries traverse your community graph, not a statistical one. Graph RAG finds what you defined as related — not what happened to co-occur in training data.

Graph RAG · HNSW

Under the Hood

Enterprise ML, not a prompt wrapper

Every API call is backed by a multi-layer machine learning stack. Here's exactly what runs and when.

All ingestion APIs

Gemini text-embedding-004

Every word and chunk is embedded into 768-dimensional semantic space using Google's Gemini embedding model, with DB caching so repeated words don't re-call the API.

768-d vectors DB cache Batch calls

strategy: "rules" (Tier 1)

Linear Projection + Category Signal

768-d Gemini vectors are projected to 256-d via a seeded random orthonormal matrix (preserves cosine distances). A 30% category one-hot signal is blended in to pull same-category words together before MLP training. L2-normalised to unit sphere. Fully deterministic and auditable.

768→256-d 30/70 blend L2 normalise QR orthonormal

strategy: "mlp" (Tier 2)

Personal PyTorch MLP

A 3-layer MLP (768→512→512→256) is trained on your word rules using InfoNCE contrastive loss (τ=0.20) plus anchor repulsion (weight=0.35). This learns the decision boundaries between your categories and generalises them to vocabulary not in your rule set. AdamW optimiser, CosineAnnealingLR schedule, gradient clipping at 1.0.

PyTorch InfoNCE loss Anchor repulsion AdamW Cosine LR

After every ingestion job

Louvain Community Detection

A word co-occurrence graph is built from ingested chunks, then Louvain modularity optimisation groups words into communities. Each community centroid is computed, stored as a pgvector VECTOR(256), and labelled in parallel using Gemini LLM calls.

python-louvain NetworkX Parallel LLM labelling

POST /api/prismrag/search

Graph RAG Retrieval Pipeline

Three-phase retrieval: (1) HNSW cosine search on community centroids to find the closest community to the query. (2) BFS 2-hop expansion on the word graph to find related words. (3) MLP re-ranking: if a Tier-2 model exists, all candidates are re-projected through the personal MLP before final cosine ranking.

HNSW m=16 BFS 2-hop MLP re-rank pgvector

POST /api/prismrag/bridges

Bridge Vector Synthesis

Cross-community semantic bridges are created by computing the normalised midpoint between two community centroids: normalize((centroid_A + centroid_B) / 2). The bridge vector is then projected through the personal MLP (if Tier 2), creating a synthetic node that connects two previously separate semantic clusters without retraining.

Centroid midpoint MLP projection No retraining

POST /api/deliberation/sessions

Multi-Agent Deliberation (LLM ensemble)

The Deliberation pipeline runs a structured ensemble of Gemini LLM calls: one horizontal domain discovery call, up to 10 parallel vertical expert calls (each with domain-specific prompt engineering), and one Master Deliberator synthesis call. Based on the Mixture of Agents (MoA) technique — diverse independent reasoning outperforms a single call on complex questions.

LLM ensemble Parallel inference Conflict detection MoA technique

Every API endpoint

Three-Layer Metering

Usage is enforced and tracked at three levels: Redis sliding window for per-minute rate limiting (<1ms), Redis monthly counter for quota tracking (Postgres fallback), and async Postgres writes for usage_event audit log. Zero DB latency added to the request path.

Redis sliding window Monthly quota Async audit log

Deliberation

MCP-native · API-first · KB-grounded

Seven experts. One synthesised answer.

Complex decisions span finance, legal, ops, and more. Deliberation discovers relevant domains, queries them in parallel, and surfaces real disagreements — not hedged single-model prose.

Deliberation pipeline: question to domain experts to synthesis

Phase 1

Horizontal Discovery

LLM + your knowledge base community search surfaces the top 7 most relevant domains for the question. Domains from both sources get a 15% score boost (source="hybrid").

~2–5 seconds

→

Phase 2

Vertical Expert Queries

All 7 domains are queried in parallel. Each domain gets a targeted deep prompt as a domain specialist, with optional KB search injecting your organisation's documents per domain.

~8–20 seconds

→

Phase 3

Master Deliberator

A Master Deliberator compares all domain findings. Surfaces agreements, flags genuine conflicts between domain logic, extracts unique insights, and synthesises a final weighted answer.

~5–12 seconds

✗ Standard single-model RAG

The same model generates both sides of "on one hand… on the other hand…" — so it tends toward vague consensus. All domains compete for the same attention budget. There's no real conflict detection because the model doesn't want to contradict itself. You can't see which domain contributed what.

✓ PrismRAG Deliberation

Finance, Antitrust, and HR each reason independently before any synthesis. Real conflicts emerge from domain logic differences, not hedging. The Master Deliberator can say "Finance says go, Antitrust assigns 40% block probability" — a genuine disagreement from independent reasoning. Every domain's contribution is visible and auditable.

Who uses Deliberation

Real questions that need real multi-domain answers

⚖

M&A Analysis

Finance, Antitrust, HR, Technology Integration, Market Strategy, Regulatory, and Culture domains each weigh in — then conflicts are surfaced.

Replaces $50K–$500K consulting engagement

Medical Differential

Cardiology, Neurology, Endocrinology, and Psychiatry all examine the same symptom set independently before synthesis.

Replaces $500–$3,000 specialist consultation

◆

Regulatory Compliance

GDPR, CCPA, common law, and sector regulations examined in parallel. Conflicts between frameworks are explicitly surfaced.

Replaces $2,000–$10,000 legal research memo

◈

Market Entry Strategy

Market Entry, Regulatory, Cultural, Supply Chain, Competitive, Financial, and Political Risk — all examined automatically.

Replaces $50,000–$200,000 market research report

◉

Insurance Risk Assessment

Actuarial, Legal, Environmental, Behavioral, and Operational domains assess the same risk from different expert angles.

Replaces $5,000–$50,000 risk assessment

⚙

AI Agent Reasoning

Connect via MCP. Your Claude/ChatGPT agent calls deliberate(question) and gets a full multi-domain synthesis automatically.

MCP-native: no custom orchestration needed

MCP Integration

AI agents can deliberate in one tool call

# Claude Desktop / any MCP-compatible AI agent
# Just one tool call — the full 3-phase pipeline runs automatically

deliberate({
  "question":     "Should we acquire CompetitorX in Q4?",
  "domain_count": 7,
  "tenant_id":    "your-prismrag-workspace"  // optional KB grounding
})

// Returns:
{
  "agreements":      "Finance & Strategy both see 12–18% synergy...",
  "conflicts":       "Finance: 70% deal probability. Antitrust: 40%.",
  "unique_insights": "HR flagged culture clash not in financial models",
  "final_answer":    "Strong strategic fit, material regulatory risk...",
  "confidence":      0.81
}

PrismRAG Features

Enterprise-grade from day one

⬡

Any input source

SQL query, Excel upload, REST API, or existing pgvector chunk store. One job API handles all of them.

◈

Tier 1 + Tier 2

Start with explicit rules (fully auditable). Upgrade to MLP training that generalises your rules to new vocabulary without rewriting them.

◎

HNSW vector indexes

Every vector table ships with HNSW indexes (m=16, ef_construction=64). Millisecond cosine search across millions of chunks — not sequential scans.

⬡

Bridge vectors

Connect two existing communities post-hoc with a synthetic bridge node — no retraining required. Instantly links semantic clusters across domains.

◈

Per-client isolation

Every tenant gets their own vector space, their own community graph, their own MLP model weights. Complete data isolation by design.

◎

Full audit trail

Every retrieval decision traces back to a versioned mapping rule. Regulators can ask "why did this document appear?" and get a real answer.

⬡

Async large file ingestion

Files under 1MB run inline. Up to 500MB upload directly to Azure Blob, queue via Service Bus, processed streaming row-by-row — never held in memory.

◈

MCP server built in

9 MCP tools out of the box: search, submit_job, create_bridge, deliberate, and more. Connect any Claude, ChatGPT, or custom AI agent in minutes.

Pricing

Simple, transparent pricing

Both products included in every plan. Start free — no credit card required.

⬡ PrismRAG — Knowledge Graph

Free

$0/month

Try PrismRAG with no commitment

✓ 5,000 chunks / month
✓ 1 workspace
✓ CSV / Excel ingestion
✓ Graph RAG retrieval
– Tier-2 MLP training
– Bridge vectors
– SQL / API ingestion

Start free

Starter

$49/month

For teams building their first pipeline

✓ 50,000 chunks / month
✓ 1 workspace · 3 mappings
✓ CSV / Excel / API ingestion
✓ Graph RAG retrieval
– Tier-2 MLP training
– Bridge vectors
✓ Email support

Get started

◈ Deliberation — Multi-Domain AI Synthesis

Included with every PrismRAG plan. Deliberation credits are separate from chunk limits.

Free

$0/month

5 deliberations / month

Up to 7 domains per session
Sync + async mode
Follow-up questions (free)
REST API access

Start free

Starter

$29/month

50 deliberations / month

Up to 7 domains per session
Sync + async mode
Follow-up questions (free)
$0.25 per overage deliberation

Get started

Professional

$99/month

500 deliberations / month

Up to 10 domains per session
KB-grounded per domain
MCP tools included
$0.25 per overage deliberation

Start Professional

Enterprise

Custom

Unlimited deliberations

Unlimited domains
Custom domain taxonomy
Private MCP endpoint
Dedicated support

Contact sales

💡

Bundle both products

Professional PrismRAG ($199/mo) + Professional Deliberation ($99/mo) = $298/mo for the full enterprise knowledge + reasoning platform. Contact sales for bundle discounts on annual plans.

Ready to own your knowledge graph?

5,000 free chunks and 5 free deliberations every month. No credit card. Full API access from day one.

Start building free Talk to sales

Stop letting documents define your knowledge graph.

Knowledge you control. Reasoning you can trust.

PrismRAG

Deliberation

Your team should own the graph — not the corpus

From your rules to intelligent retrieval

Define your mapping

We re-map the space

Retrieval reflects your intent

Enterprise ML, not a prompt wrapper

Gemini text-embedding-004

Linear Projection + Category Signal

Personal PyTorch MLP

Louvain Community Detection

Graph RAG Retrieval Pipeline

Bridge Vector Synthesis

Multi-Agent Deliberation (LLM ensemble)

Three-Layer Metering

Seven experts. One synthesised answer.

✗ Standard single-model RAG

✓ PrismRAG Deliberation

Real questions that need real multi-domain answers

M&A Analysis

Medical Differential

Regulatory Compliance

Market Entry Strategy

Insurance Risk Assessment

AI Agent Reasoning

AI agents can deliberate in one tool call

Enterprise-grade from day one

Any input source

Tier 1 + Tier 2

HNSW vector indexes

Bridge vectors

Per-client isolation

Full audit trail

Async large file ingestion

MCP server built in

Simple, transparent pricing

⬡ PrismRAG — Knowledge Graph

◈ Deliberation — Multi-Domain AI Synthesis

Ready to own your knowledge graph?

Stop letting documents
define your knowledge graph.