Private multi-tenant RAG. Two API calls.
Add document intelligence to your product or your organization in 72 hours. Every tenant — every customer, every department — gets a fully isolated knowledge base. Hosted on our infrastructure or deployed inside your own cloud.
Building RAG in-house costs more than it looks.
Every team underestimates it. The chunking is easy. Multi-tenancy, re-ranking, latency at scale, embedding drift, security audits — that's where the 6 months go.
2 engineers × 6 months to build + maintain
before your first customer can search a document
maintenance, upgrades, security patches
Formulate RAG: flat monthly license. Live in 72 hours. No engineering headcount. No maintenance burden. Scale from 10 tenants to 10,000 without changing a line of code.
Document in. Answer out. Every tenant isolated.
Two customer types. Two deployment models.
You have customers. They need their own knowledge base.
Your customers want document search, Q&A on their own data, or a private knowledge base — inside your product. Building multi-tenant RAG takes 2 engineers × 6 months.
RAG becomes a silent feature inside your product. Each customer gets a fully isolated knowledge base. You deploy in your own cloud, data never leaves your VPC. Your customers never know it's Formulate.
You have departments. Each one needs its own RAG.
One org, many teams — HR, Legal, Finance, Clinical. Each needs to search their own documents without seeing each other's data. No DevOps team to maintain infrastructure.
Call our hosted API. Each department gets a fully isolated namespace. No infra work, no DevOps. Pass the tenant ID in the header — isolation is handled at the database layer.
Built for regulated industries from day one.
Every enterprise RAG deployment runs a compliance gauntlet — DPDP, HIPAA, RBI, SEBI, SOC2. Formulate RAG was designed with these requirements baked in, not bolted on.
Data Principal and Fiduciary obligations met. Data stays within India on BYOC deployments.
Healthcare-grade data handling. BAA available. Air-gapped deployment for regulated clinical data.
Financial data never co-mingled across tenants. Native isolation satisfies regulatory audit requirements.
Evidence collection automated. Audit-ready posture. Security controls documented and tested.
BYOC: retrieval never leaves your cloud. Generation via Bedrock or Azure OpenAI stays in your account.
Native isolation at the pgvector layer — not a filter, not access control. Architecturally impossible for tenants to cross-query.
True data sovereignty in BYOC mode: Retrieval never leaves your cloud. For generation, use AWS Bedrock or Azure OpenAI — both keep LLM calls within your cloud account. Bringing your own OpenAI key routes generation through OpenAI's servers — not ours.
Two API calls. That's the whole integration.
POST to /ingest to upload documents. POST to /search to retrieve. Pass your tenant ID in the header. Done.
POST /ingest
X-Tenant-ID: acme-corp
Content-Type: multipart/form-data
file: product_docs_v3.pdf
→ 200 OK
{
"job_id": "ing_01jwx4k",
"status": "processing",
"webhook": "notified on completion"
}POST /search
X-Tenant-ID: acme-corp
Content-Type: application/json
{
"query": "What is the return policy for enterprise plans?",
"top_k": 5
}
→ 200 OK (<300ms)
{
"results": [
{
"content": "Enterprise plans include a 30-day...",
"score": 0.94,
"source": "product_docs_v3.pdf"
}
]
}Your cloud or ours. Your choice.
We host. You call.
Formulate AI runs the full RAG stack on our infrastructure. You make two API calls — /ingest and /search. We handle availability, scaling, and ops.
- No infra work on your side
- SLA-backed 99.5% uptime
- Live in 72 hours
- Monthly flat pricing
Best for enterprises getting started fast
Your cloud. Our software.
Deploy Formulate RAG inside your own AWS, Azure, or GCP account. You own the data, the infra, and the compute. We provide the software and the deployment package.
- Data never leaves your VPC
- Bedrock / Azure OpenAI for generation
- CloudFormation + Helm chart included
- Software license — you pay cloud bill
Best for platforms with data sovereignty requirements
Cloud RAG wasn't built for multi-tenancy.
| Feature | Formulate RAG | AWS Bedrock | Google Vertex |
|---|---|---|---|
| Multi-tenant out of the box | ✕ | ✕ | |
| Tenant isolation (native) | ✕ | ✕ | |
| Deploy in your cloud (BYOC) | ✕ | ✕ | |
| LLM agnostic (any provider) | ✕ | ✕ | |
| Flat pricing (not per KB) | ✕ | ✕ | |
| Live in 72 hours | ✕ | ✕ | |
| No infra work required |
Flat pricing. No per-query surprises.
One-time setup fee $2,500 covers integration support, developer onboarding, and 30-day hypercare. Annual billing saves 15%.
- REST API (ingest + search)
- Tenant-isolated namespaces
- PDF, DOCX, TXT, CSV, HTML
- Async ingestion + webhooks
- Email support
- Everything in Starter
- Per-tenant usage dashboard
- Query logs + analytics
- Slack support channel
- Response < 4 hours
- Everything in Growth
- Dedicated onboarding engineer
- 30-day hypercare
- 99.5% uptime SLA
- Annual: $40,800 (15% off)
- Deploy in your AWS/Azure/GCP
- Bedrock / Azure OpenAI ready
- CloudFormation / Helm chart
- Full data sovereignty
- One-time setup: $3,000
Pilot offer: 3 months at $2,000/mo.
Up to 100 tenants. No commitment for full rollout. Prove it works in your product or org, then expand.
contact@formulateai.in · Built in Bangalore · Data stays yours