Services
Engagements that ship, priced to the outcome
Start with a fast, measurable win and grow into the platform. Every engagement leaves you owning working software, runbooks, and the instrumentation to keep improving without us.
Tier 1 — Direct fits
Fast, measurable engagements that open the door to bigger builds.
LLM Cost & Performance Optimization
→Most teams running production AI pay 3–10x what they should. We audit prompts, model routing, caching, batching, and fallback chains, then ship cost-aware routing and observability.
- Outcome
- 30–60% LLM cost reduction in 4 weeks, documented
- Timeline
- 4–6 weeks
- Pricing
- $15–40k, or outcome-priced (25% of first-year savings)
Agentic Codebase Readiness Audit
→We map your codebase against what actually makes it AI-coding-ready — module boundaries, test coverage, type strictness, docs, CLAUDE.md / rules files, and MCP potential — and quantify how far off you are.
- Outcome
- A scored report + prioritized remediation roadmap
- Timeline
- 2–3 weeks
- Pricing
- $10–25k audit (often leads to remediation)
Internal AI Coding Workflow Build-Out
→We standardize how your team uses Claude Code / Cursor / Copilot — custom slash commands, agent definitions, MCP servers for your internal tools, golden-path templates, and code-review hooks.
- Outcome
- A working internal AI dev platform your team adopts
- Timeline
- 6–10 weeks
- Pricing
- $40–120k build, $5–10k/mo tuning
Production AI Eval Infrastructure
→Most teams shipped AI features with zero evals. We build eval harnesses, regression suites, online quality monitoring, and A/B infra for prompts and models.
- Outcome
- An eval platform wired into your CI/CD
- Timeline
- 4–8 weeks
- Pricing
- $30–80k build + $3–8k/mo ops
MCP Server Builds for Internal Tooling
→We expose your internal systems — Datadog, Linear, databases, deploy tooling — to AI agents over MCP, so your devs can work against company infra safely.
- Outcome
- Internal MCP servers with auth, access control, audit logs
- Timeline
- 3–6 weeks
- Pricing
- $25–75k per build, $3–8k/mo maintenance
Tier 2 — Platform builds
Larger engagements with enterprise-tier pricing and longer cycles.
Agentic Development Platform Build
→A brat-style multi-agent harness tuned to your stack. Your devs delegate tasks — refactor a module, add tests, fix a class of bugs across the codebase — with safety rails and human-in-the-loop.
- Outcome
- A multi-agent harness running against your repo
- Timeline
- 3–6 months
- Pricing
- $100–300k build, $10–25k/mo ops
Migration: Legacy Codebase to AI-Ready
→We restructure a messy monorepo so agents work well in it — module boundaries, type coverage, doc generation, test scaffolding, and spec generation from existing code.
- Outcome
- A phased restructure that agents can work in
- Timeline
- 3–6 months
- Pricing
- $150–500k+
Tier 3 — Specialized niches
High-margin work for regulated and data-sensitive teams.
AI Code Security & Supply Chain
→We audit AI-generated code for vulnerabilities, license violations, prompt-injection vectors, and data leakage, then set up policy and automated scanning in your CI.
- Outcome
- A security report + scanning infra in CI
- Timeline
- 3–5 weeks
- Pricing
- $20–50k audit, $5–12k/mo monitoring
Embeddings + Search Modernization
→We replace or augment legacy search (Elasticsearch, Algolia) with semantic search and RAG — often starting as a small AI feature and growing into a retrieval platform.
- Outcome
- A new semantic search / retrieval layer in production
- Timeline
- 6–12 weeks
- Pricing
- $40–120k build + $4–10k/mo ops
Self-Hosted LLM Infrastructure
→For data-sensitive teams (healthcare, finance, defense, EU): local inference, RAG pipelines, and fine-tuning workflows on infrastructure you control.
- Outcome
- A working private AI stack on your cloud or on-prem
- Timeline
- 8–16 weeks
- Pricing
- $75–250k build, $10–20k/mo ops
Tier 4 — Productized retainers
Predictable revenue, deeper relationships.
Not sure where to start?
Most teams start with the LLM cost audit — it pays for itself fastest and tells us both whether there’s a bigger build worth doing.