LimitedWe'll build your AI MVP. Free. — 3-month Vibe CTO after delivery.Apply →
Formulate RAG

Private multi-tenant RAG. Two API calls.

Add document intelligence to your product in 72 hours. Tenant-isolated, data-sovereign RAG — hosted on our infrastructure or deployed in your own cloud.

<300ms
P99 retrieval latency
2
API calls to integrate
72h
from agreement to live
tenants, one deployment

Two customer types. Two models.

B2B2B · Platforms

You have customers.
They need RAG.

RAG becomes a silent feature inside your product. Your customers use it — they never know it's Formulate. You deploy in your own cloud, data stays in your VPC.

e.g. messaging platforms, CRM tools, support apps, chatbot products

Deploy in your cloud (BYOC) — $5,500/mo
B2B · Enterprises

You have departments.
They need RAG.

One org, many teams. HR, Legal, Finance, Clinical — each gets their own isolated RAG namespace. Call our hosted API. No infra work, no DevOps.

e.g. hospitals, banks, insurance firms, universities, conglomerates

Hosted API — from $800/mo

Two API calls. That's the whole integration.

POST to /ingest to upload documents. POST to /search to retrieve. Pass your tenant ID in the header. Done.

Step 1 — Ingest documents
POST /ingest
X-Tenant-ID: acme-corp
Content-Type: multipart/form-data

file: product_docs_v3.pdf

→ 200 OK
{
  "job_id": "ing_01jwx4k",
  "status": "processing",
  "webhook": "notified on completion"
}
Step 2 — Search at query time
POST /search
X-Tenant-ID: acme-corp
Content-Type: application/json

{
  "query": "What is the return policy for enterprise plans?",
  "top_k": 5
}

→ 200 OK  (<300ms)
{
  "results": [
    {
      "content": "Enterprise plans include a 30-day...",
      "score": 0.94,
      "source": "product_docs_v3.pdf"
    }
  ]
}
PDF · DOCX · TXT · CSV · HTMLAsync ingestion + webhooksSemantic search (pgvector)Hybrid BM25 + vectorTop-k with scores + sourcesLLM-agnostic (returns chunks)

Your cloud or ours.

Hosted API

We host. You call.

Formulate AI runs the full RAG stack on our infrastructure. You make two API calls — /ingest and /search. We handle availability, scaling, and ops.

  • No infra work on your side
  • SLA-backed 99.5% uptime
  • Live in 72 hours
  • Monthly flat pricing

Best for enterprises and teams getting started fast

Your Cloud (BYOC)

Your cloud. Our software.

Deploy Formulate Agent inside your own AWS, Azure, or GCP account. You own the data, the infra, and the compute. We provide the software and the deployment package.

  • Data never leaves your VPC
  • Bedrock / Azure OpenAI for generation
  • CloudFormation + Helm chart included
  • Software license — you pay cloud bill

Best for platforms with data sovereignty requirements

True data sovereignty note: Retrieval never leaves your cloud in BYOC mode. For generation, use AWS Bedrock or Azure OpenAI — both keep LLM calls within your cloud account. Bringing your own OpenAI key routes generation through OpenAI's servers.

Cloud RAG wasn't built for multi-tenancy.

FeatureFormulate RAGAWS BedrockGoogle Vertex
Multi-tenant out of the box
Tenant isolation (native)
Deploy in your cloud (BYOC)
LLM agnostic (any provider)
Flat pricing (not per KB)
Live in 72 hours
No infra work required

Build in-house comparison: 2 backend engineers × 6 months = $150k–$200k. Ongoing maintenance $50k/year. Cloud RAG services charge per knowledge base — costs compound fast with multi-tenancy. Formulate RAG is a flat monthly license.

Flat pricing. No per-query surprises.

One-time setup fee $2,500 covers integration support, developer onboarding, and 30-day hypercare. Annual billing saves 15%.

Starter
$800/ month
Up to 100 tenants
$8 / tenant
  • REST API (ingest + search)
  • Tenant-isolated namespaces
  • PDF, DOCX, TXT, CSV, HTML
  • Async ingestion + webhooks
  • Email support
Get started
Growth
$2,500/ month
Up to 500 tenants
$5 / tenant
  • Everything in Starter
  • Per-tenant usage dashboard
  • Query logs + analytics
  • Slack support channel
  • Response < 4 hours
Get started
Scale
$4,000/ month
Up to 1,000 tenants
$4 / tenant
  • Everything in Growth
  • Dedicated onboarding engineer
  • 30-day hypercare
  • 99.5% uptime SLA
  • Annual: $40,800 (15% off)
Get started
BYOC
$5,500/ month
Unlimited tenants
You pay your cloud bill
  • Deploy in your AWS/Azure/GCP
  • Bedrock / Azure OpenAI ready
  • CloudFormation / Helm chart
  • Full data sovereignty
  • One-time setup: $3,000
Get started

Pilot offer: 3 months at $2,000/mo.

Up to 100 tenants. No commitment for full rollout. Prove it works in your product, then expand.

contact@formulateai.in · Built in Bangalore · Data stays yours