Productized engagement
Private AI Pilot.
Four weeks. One production use case.
A fixed-scope, fixed-timeline Private AI pilot. On-premise LLM, RAG with citations, eval suite, audit logging, and an internal walkthrough so your team owns it after I leave. Built for regulated sectors where the cloud is not an answer.
Timeline
4 weeks
Scope
1 use case
Pricing
From $15k
Support
30 days
The four weeks, in order.
Week 1
Discovery and architecture
- Map the use case end to end: users, data, decisions, latency budget
- Compliance and residency review (PDPL, DIFC, GDPR as applicable)
- Model and infra selection. Sign off on architecture document
Week 2
Stand up the stack
- Deploy the model serving layer (vLLM or Ollama) on your infra
- Build the RAG pipeline with chunking and citation surface
- Wire identity, role-based access, and audit logging
Week 3
Build and integrate
- Ship the production interface (chat, search, or agent UI)
- Integrate with the source of truth: docs, DB, internal APIs
- Add eval suite with a 100-row golden dataset and LLM-as-judge
Week 4
Harden, ship, hand off
- Cost guardrails, rate limits, and incident playbook
- Internal team enablement: 90-min walkthrough plus a recorded primer
- Production launch with a 30-day support window
What's in the box.
- On-premise LLM deployed on your infrastructure
- RAG pipeline with citation-first responses
- Eval suite with golden dataset and LLM-as-judge
- Identity, RBAC, and audit log to your SIEM
- Cost guardrails and rate limits
- Architecture document plus runbook
- Recorded internal walkthrough for your team
- 30-day post-launch support
Fit check.
The pilot ships best when these four things are true. If only some are true, a discovery call will sort whether the pilot or a different scope fits.
- You have a defined use case (not a research question)
- You have data the LLM should ground on
- You have a stakeholder who will use the result the day it ships
- You operate in a sector where data residency is non-negotiable
Reserve a pilot slot.
I take two pilots in flight at any time. Discovery call locks scope and start date.