We talk to companies about custom AI almost every day. And we tell maybe three out of five of them: "You don't need us. Use the ChatGPT API directly, or buy an off-the-shelf automation tool, and save yourself six months and half a million dollars."

That honesty costs us deals in the moment. It gains us credibility over time. Because when we're right about the build-vs-buy call, those companies remember. And when custom development is actually necessary, they call us back.

The build-vs-buy decision for AI is not obvious. The tools are so capable now that off-the-shelf solutions can handle what would have required a custom system two years ago. But if you've got data that competitors don't have, or workflows too specific for generic tools to match, custom is the only answer. The trick is knowing which camp you're in without wasting six months finding out.

The case for off-the-shelf: be honest about what you need.

ChatGPT, Claude, and their APIs are phenomenally powerful. For most use cases, they're enough. If you're automating customer support responses, routing emails, classifying documents, or generating copy, an off-the-shelf LLM with some prompt engineering handles it. You get a working system in days, not months.

The cost advantage is real and brutal. Custom development costs $200k–$500k and takes three to six months. An off-the-shelf tool costs $500/month and ships in a week. The maintenance burden is zero. Someone at OpenAI is keeping the model updated. You're not managing infrastructure. You're not paying for fine-tuning. You're not hiring AI engineers.

Off-the-shelf is also the smart call for your first AI project. You learn the shape of the problem. You see where the tool breaks. You build internal conviction about what custom actually buys you. Then, when you're ready for custom, you know exactly what it needs to do.

Off-the-shelf fails in two scenarios: (1) you have proprietary data that gives you a real edge, and generic models can't learn from it, or (2) your workflow is so specific that a generic tool would require months of prompt engineering to work at all - in which case you're building custom anyway, just worse.

The case for custom: data advantage and workflow specificity.

Custom AI makes sense when two things are true simultaneously: you have data competitors don't have, and you have a workflow so specific that off-the-shelf tools force you to distort your process to fit their constraints.

The data advantage is the cleaner case. If you have ten years of customer interaction logs, product usage data, or domain-specific training examples that competitors don't have access to, that data is leverage. A custom fine-tuned model trained on your data will outperform a generic model by a measurable margin. That margin compounds: better decisions feed better training data, which feeds a better model. That's a real moat.

Sentinel, our own product (Arjun leads the analytics layer), exists because of this principle. We built custom AI on top of Shopify usage data because the behavior patterns in e-commerce are learnable and no generic model would ever see them. That's custom development justified. To make the cost comparison concrete: at 100 Shopify stores with 90 days of daily KPI metrics, the entire dataset is approximately 54,000 rows - around 5MB total. The right database choice costs between $0 (SQLite on disk, Phase 1) and $5-10 per month (Phase 2 with scheduling). The LLM call per report runs roughly $0.003 on Claude Sonnet for structured output. Off-the-shelf analytics tools in this category typically charge per store per month with no LLM reasoning layer. The "tool, not mirror" distinction is what justifies custom here: a generic dashboard reflects what the merchant already knows; a custom system surfaces what they cannot see and names what it cost them. That is the ITP framing - the tool does the work for the operator instead of showing the operator data about themselves. If the output could have been produced by the merchant looking at their own Shopify dashboard, the custom development is not justified. If it could not have been, it usually is.

Workflow specificity is harder to evaluate. The question is: if you used an off-the-shelf tool, how much would you have to change your process to fit the tool's constraints? If the answer is "barely at all," buy the tool. If the answer is "fundamentally," then custom makes sense. But be honest. Most teams overestimate how special their workflow is.

Custom is also the answer for compliance and control requirements. If you're in regulated industries (finance, healthcare, legal), you may need to run your model on your own infrastructure, own your training data completely, and audit every decision. Off-the-shelf becomes much harder when compliance drives the design.

The five questions that resolve the decision.

Ask these honestly. Your answer determines build vs. buy.

1. Do you have data that competitors don't?

Not data you could get. Data you have now. If the answer is no, or "not really," off-the-shelf is fine.

2. Would your current workflow need to change significantly to use an off-the-shelf tool?

If you can bolt the tool onto what you're doing today, buy. If you'd need to rebuild your process, consider custom.

3. Is accuracy improvement worth $300k to you?

Custom can be 5–15% more accurate than off-the-shelf on your specific problem, because it's trained on your data. Is that marginal gain worth the cost and time? For most use cases, no. For some, yes.

4. How much compliance burden are you under?

Heavy compliance requirements push you toward custom. Light? Off-the-shelf usually works.

5. What happens if you're wrong about off-the-shelf?

If you start with off-the-shelf and realize it's not good enough in three months, migration to custom is painful but doable. You've learned the problem. Starting custom without that learning is riskier.

The hybrid reality: custom + off-the-shelf together.

Most mature production deployments use both. You might use the OpenAI API for general-purpose summarization, but fine-tune a custom model for your domain-specific decision-making. You might use off-the-shelf automation for customer support, but build custom classification for your rarest, highest-stakes cases.

This hybrid approach is underrated. It lets you ship fast, learn what custom actually needs to solve, and invest custom effort where it compounds. You're not betting the whole system on one decision. You're testing, and scaling custom only where it matters.

When custom development is actually the answer.

If you've answered yes to most of the five questions above, custom is right. Our AI development practice exists to handle exactly this case: you have a specific, high-stakes problem, proprietary data that gives you leverage, and workflows that off-the-shelf tools can't match. We run fourteen-day sprints to production. No strategy decks. No vendor dances. Code that runs.

The AI integration playbook walks through the full process if you're building custom. But before you commit to that path, be honest about whether you actually need custom, or whether off-the-shelf with some engineering is the smarter call.

The right answer is not always "build custom." Sometimes it's "use ChatGPT and get back to shipping."

The build-vs-buy decision is not about hubris. It's about capital efficiency. Custom AI is powerful when it solves a problem that off-the-shelf can't touch. It's a waste of money and time when off-the-shelf would do. The hardest part is being honest about which camp you're in.

If you've worked through the five questions and you're still on the fence, that's usually a signal to start with off-the-shelf, learn the shape of the problem, and revisit custom in six months. The time you spend learning is not lost. It's the foundation for custom development that actually ships.