LimitedWe'll build your AI MVP. Free. — 3-month Vibe CTO after delivery.Apply →
Formulate RAG

Private multi-tenant RAG. Two API calls.

Add document intelligence to your product or your organization in 72 hours. Every tenant — every customer, every department — gets a fully isolated knowledge base. Hosted on our infrastructure or deployed inside your own cloud.

<300ms
P99 retrieval latency
2
API calls to integrate
72h
from agreement to live
tenants, one deployment
The build-vs-buy reality

Building RAG in-house costs more than it looks.

Every team underestimates it. The chunking is easy. Multi-tenancy, re-ranking, latency at scale, embedding drift, security audits — that's where the 6 months go.

Engineering cost
$150–200K

2 engineers × 6 months to build + maintain

Time to market
6 months

before your first customer can search a document

Ongoing cost
$50K+/yr

maintenance, upgrades, security patches

Formulate RAG: flat monthly license. Live in 72 hours. No engineering headcount. No maintenance burden. Scale from 10 tenants to 10,000 without changing a line of code.

Document in. Answer out. Every tenant isolated.

📄
Documents
PDF · DOCX · TXT · CSV
/ingest
POST with tenant ID
🗄
Nyas
pgvector · S3
🔍
/search
<300ms P99
Results
chunks + scores + source
Each tenant gets a fully isolated namespace
Tenant AAcme Corp
📄policy.pdf
📄handbook.docx
ISOLATED NAMESPACE
Tenant BBeta Inc
📄contracts.pdf
📄sops.txt
ISOLATED NAMESPACE
Tenant CGamma Ltd
📄specs.pdf
📄wiki.html
ISOLATED NAMESPACE
🔒Tenant A's searches never touch Tenant B's data — isolated at the pgvector layer, not via filters.

Two customer types. Two deployment models.

B2B2CSaaS Platforms

You have customers. They need their own knowledge base.

The problem

Your customers want document search, Q&A on their own data, or a private knowledge base — inside your product. Building multi-tenant RAG takes 2 engineers × 6 months.

What you get

RAG becomes a silent feature inside your product. Each customer gets a fully isolated knowledge base. You deploy in your own cloud, data never leaves your VPC. Your customers never know it's Formulate.

Used by
Messaging platformsCRM toolsSupport softwareChatbot productsDocument management
BYOC — $5,500/mo · Unlimited tenants
B2BEnterprises

You have departments. Each one needs its own RAG.

The problem

One org, many teams — HR, Legal, Finance, Clinical. Each needs to search their own documents without seeing each other's data. No DevOps team to maintain infrastructure.

What you get

Call our hosted API. Each department gets a fully isolated namespace. No infra work, no DevOps. Pass the tenant ID in the header — isolation is handled at the database layer.

Used by
Hospitals & pharmaBanks & NBFCsInsurance firmsUniversitiesLarge conglomerates
Hosted API — from $800/mo

Built for regulated industries from day one.

Every enterprise RAG deployment runs a compliance gauntlet — DPDP, HIPAA, RBI, SEBI, SOC2. Formulate RAG was designed with these requirements baked in, not bolted on.

India DPDP Act

Data Principal and Fiduciary obligations met. Data stays within India on BYOC deployments.

HIPAA Ready

Healthcare-grade data handling. BAA available. Air-gapped deployment for regulated clinical data.

RBI / SEBI Guidelines

Financial data never co-mingled across tenants. Native isolation satisfies regulatory audit requirements.

SOC 2 Type II

Evidence collection automated. Audit-ready posture. Security controls documented and tested.

Data Sovereignty

BYOC: retrieval never leaves your cloud. Generation via Bedrock or Azure OpenAI stays in your account.

Tenant Isolation

Native isolation at the pgvector layer — not a filter, not access control. Architecturally impossible for tenants to cross-query.

True data sovereignty in BYOC mode: Retrieval never leaves your cloud. For generation, use AWS Bedrock or Azure OpenAI — both keep LLM calls within your cloud account. Bringing your own OpenAI key routes generation through OpenAI's servers — not ours.

Two API calls. That's the whole integration.

POST to /ingest to upload documents. POST to /search to retrieve. Pass your tenant ID in the header. Done.

Step 1 — Ingest documents
POST /ingest
X-Tenant-ID: acme-corp
Content-Type: multipart/form-data

file: product_docs_v3.pdf

→ 200 OK
{
  "job_id": "ing_01jwx4k",
  "status": "processing",
  "webhook": "notified on completion"
}
Step 2 — Search at query time
POST /search
X-Tenant-ID: acme-corp
Content-Type: application/json

{
  "query": "What is the return policy for enterprise plans?",
  "top_k": 5
}

→ 200 OK  (<300ms)
{
  "results": [
    {
      "content": "Enterprise plans include a 30-day...",
      "score": 0.94,
      "source": "product_docs_v3.pdf"
    }
  ]
}
PDF · DOCX · TXT · CSV · HTMLAsync ingestion + webhooksSemantic search (pgvector)Hybrid BM25 + vectorTop-k with scores + sourcesLLM-agnostic (returns chunks)

Your cloud or ours. Your choice.

Hosted API

We host. You call.

Formulate AI runs the full RAG stack on our infrastructure. You make two API calls — /ingest and /search. We handle availability, scaling, and ops.

  • No infra work on your side
  • SLA-backed 99.5% uptime
  • Live in 72 hours
  • Monthly flat pricing

Best for enterprises getting started fast

Your Cloud (BYOC)

Your cloud. Our software.

Deploy Formulate RAG inside your own AWS, Azure, or GCP account. You own the data, the infra, and the compute. We provide the software and the deployment package.

  • Data never leaves your VPC
  • Bedrock / Azure OpenAI for generation
  • CloudFormation + Helm chart included
  • Software license — you pay cloud bill

Best for platforms with data sovereignty requirements

Cloud RAG wasn't built for multi-tenancy.

FeatureFormulate RAGAWS BedrockGoogle Vertex
Multi-tenant out of the box
Tenant isolation (native)
Deploy in your cloud (BYOC)
LLM agnostic (any provider)
Flat pricing (not per KB)
Live in 72 hours
No infra work required

Flat pricing. No per-query surprises.

One-time setup fee $2,500 covers integration support, developer onboarding, and 30-day hypercare. Annual billing saves 15%.

Starter
$800/ month
Up to 100 tenants
$8 / tenant
  • REST API (ingest + search)
  • Tenant-isolated namespaces
  • PDF, DOCX, TXT, CSV, HTML
  • Async ingestion + webhooks
  • Email support
Get started
Growth
$2,500/ month
Up to 500 tenants
$5 / tenant
  • Everything in Starter
  • Per-tenant usage dashboard
  • Query logs + analytics
  • Slack support channel
  • Response < 4 hours
Get started
Scale
$4,000/ month
Up to 1,000 tenants
$4 / tenant
  • Everything in Growth
  • Dedicated onboarding engineer
  • 30-day hypercare
  • 99.5% uptime SLA
  • Annual: $40,800 (15% off)
Get started
BYOC
$5,500/ month
Unlimited tenants
You pay your cloud bill
  • Deploy in your AWS/Azure/GCP
  • Bedrock / Azure OpenAI ready
  • CloudFormation / Helm chart
  • Full data sovereignty
  • One-time setup: $3,000
Get started

Pilot offer: 3 months at $2,000/mo.

Up to 100 tenants. No commitment for full rollout. Prove it works in your product or org, then expand.

contact@formulateai.in · Built in Bangalore · Data stays yours