The Approach

How we implement AI that stays running.

Most AI doesn't fail because the model was wrong. It fails because nobody adopts it. This is the operating model we use to build systems the team actually keeps using, and to keep them that way.

Why AI systems get built, then quietly abandoned

Most of the failed systems we get called in to look at didn't fail loudly. They launched, they worked in the demo, and a few weeks later the team had drifted back to the old spreadsheet. Nobody filed a complaint. They just stopped opening it.

Almost none of those failures were really about the model. They came down to whether people folded it into their actual day. A handful of patterns show up over and over:

  • It got something wrong once, in front of someone who mattered. That's usually all it takes. People will give a coworker who slips up another shot. They won't extend that to software, especially after it's embarrassed them with a customer on the line.
  • It made people work differently. A new tab, a new login, one more habit to remember. That's a hard trade against a spreadsheet someone has leaned on for years. The systems that stick are the ones that meet people inside the tools they already use.
  • It fixed the wrong problem. The demo looked sharp because it was pointed at something flashy, not at the part of the business that's genuinely slow or expensive. It got picked because someone was excited, not because it sat on top of real money.
  • Nobody asked the people who'd have to use it. A system the team didn't help shape tends to feel like it's there to watch them or replace them, so they route around it. That usually gets decided long before any of the technical questions do.
  • No one tracked what it was worth. If nobody can say what it saved or earned, it's the first line cut when budgets get tight. A system without an owner rarely survives its first lean quarter.

None of that is a model problem. It's an operating problem, and it's the one we're built to solve. Our four phases are organized around these specific failures, and we start working against them in the first week.

01

The four phases

A sequence, not a methodology theater. Each phase ends with a written artifact the team can act on.
PHASE 01 · DISCOVER
Map the system you actually have.

Workflows, tools, data, people, the outcome on the line. Two-week embed. Output: a written diagnosis.

PHASE 02 · DESIGN
Architect backwards from the outcome.

Model selection, orchestration, integrations, evals. Output: a system spec your engineering team can read.

PHASE 03 · DEPLOY
Ship into your stack, not a sandbox.

We build with your data, your auth, your identity. Output: a production system with monitoring on day one.

PHASE 04 · OPERATE
Stay engaged. Tune. Expand.

Retained partnership. Monthly business reviews on real outcomes. Output: a system that compounds in value.

Phase 01 · Discover

We don't start with the AI. We start with the workflow it's supposed to fit inside. A senior engineer plus an analyst embed for two weeks. They sit in the meetings. They watch the work. They read the ticket queue, the Slack channels, the support transcripts.

The output is a written current-state map: every system in scope, every integration that matters, every constraint that's load-bearing, and a single chart that shows where the time and the margin are actually going. We finish with a ranked list of what to build, and what to not build, framed against P&L impact.

What you can expect by the end of week two

  • A current-state diagram of the in-scope systems and data flow.
  • A short-list of three to five intervention opportunities, sized by impact and lift.
  • A recommendation: which to build first, and why.
  • An honest assessment of what's not worth doing yet.

Phase 02 · Design

Once we know what we're building, we spec it. This phase is short, usually one to two weeks, and produces a system architecture that engineering and operations can both read.

We pick the model, the orchestration framework, the vector store (or whether you even need one), the integration pattern, the identity and permissions story, and the evaluation harness. We design the operational interface, who will use it, how, and what failure looks like when it happens.

The cheapest time to change your mind is at the end of design. We spend disproportionate time here on purpose.

Phase 03 · Deploy

We build in your environment, with your data, behind your auth, and inside the tools your team already uses. We don't build in a sandbox and then promise it'll work on the other side, because a system the team has to leave their workflow to use is a system they'll quietly stop using.

Every system we ship comes with three guarantees from day one: structured logs you can grep, evals that run on every change, and a dashboard that shows operational health to whoever needs to see it. We hand over runbooks for the on-call rotation if there is one.

Typical deploy phase is six to fourteen weeks, depending on integration surface area. We work in two-week iterations, demoing into production behind a flag from week four.

Phase 04 · Operate

The default after launch is a retained engagement. We tune the system as it learns what the actual usage patterns look like. We expand scope when an adjacent opportunity surfaces. We run a monthly business review with a single slide: what did this system save, save you from, or earn, in dollars or hours.

Most clients work with us on a retainer precisely because the operational layer is where the long-term value lives. A system that's well-built but unmaintained quietly slides from useful to harmful in about four months.


How engagements are structured

Three shapes, depending on where you are:

  • Project. Discovery → Design → Deploy. Fixed scope, fixed milestones, written hand-off. Six to fourteen weeks.
  • Retainer. Operate & Optimize on a system we built, or one you built that needs adult supervision. Monthly business reviews.
  • Hybrid. Most of our work. Project to ship, retainer to operate. Same team across both.

First 30 / 60 / 90 days

  • Day 1–30 · Discovery embed. Diagnosis delivered. Design started.
  • Day 31–60 · System spec'd and design signed off. Build underway. First eval suite running.
  • Day 61–90 · System in production behind a flag. First monthly review framework in place. Operational handoff drafted.
Next step

Start with a diagnosis, not a demo.

A two-week discovery embed produces the only honest place to start: a written map of where AI can actually move your business.