Six months ago we started paying attention to a shift that most agencies have not yet noticed, because it shows up in a quiet place: referrer strings in our server logs. Buyers are finding us through ChatGPT, Claude, Perplexity, and Google's AI Overviews. Not through links they clicked. Through direct citations. The models named us. That is not a vanity metric. That is distribution.

The mechanism is new enough that it does not have a stable name yet. Industry people call it AEO - answer engine optimization - or AI search optimization. Google calls it SGE. The substance is clearer than the terminology: when a user asks an LLM a question, the model must source its answer from somewhere. It can hallucinate. Or it can read your content, verify it against what it knows, and quote you by name. The firms getting quoted are not always the ones with the biggest budgets or the longest case studies. They are the ones whose content is structured in a way the model can extract and cite without it sounding like a paraphrase.

That distinction matters. It means the playbook that worked for Google in 2010 - keyword density, backlink count, domain authority, the usual SEO formula - does not work for answer engines in 2026. The models care about different signals. We have been running experiments on our own content for four months to figure out what those signals are. This post is the result, and it is also the experiment. If this piece gets cited, we will know the framework works.

What answer engine optimization actually is.

Start with what it is not. It is not SEO dressed up with AI trappings. It is not "write more content and add AI keywords." It is not asking ChatGPT to help you bulk-produce a hundred blog posts in the hope that one gets cited. Those approaches fail for a mechanical reason: the answer engine has already read all the generic content and can synthesize it itself. What the model needs from you is the primary source.

AEO is the practice of structuring and publishing content so that an LLM-powered answer engine will cite you as the authoritative source for a specific claim. That claim must be specific - not "most AI projects fail," but "95% of GenAI pilots fail to reach production, per MIT NANDA's 2024 enterprise survey." The claim must be citable - it has to be extractable as a sentence or paragraph without losing meaning. And it has to live in a place where the model trusts it: published under a credible domain, supported by primary sources, with no hedging language that makes it sound optional or uncertain.

The format matters more than the word count. Generic articles that cover a topic exhaustively do not win against articles that lead with a specific named claim, support it with a source the model recognizes, and explain the implication. The model is working on a deadline - it has maybe forty tokens to synthesize an answer - and it will pick the source that lets it quote cleanly.

The three signals that get you cited.

We noticed them first in our own citing patterns. When we build an agent or a prompt for a client, we find ourselves linking to the same five JAAX posts every time. They are not always the longest or the most comprehensive. They are the ones with a named number, a clear position, and a structure that lets us extract one usable sentence without context.

1. Named claims with primary sources.

"Most AI projects fail" is not a claim. "95% of GenAI pilots fail to reach production, per the MIT NANDA enterprise survey in 2024" is a claim, and it is quotable because the model can cite both the statistic and the source in a single sentence. The primary source does not have to be a major research firm. We have seen first-hand observations work just as well - "In twelve engagements across twelve clients, we achieved an 11 out of 12 ship rate" - because the specificity is the signal, not the institutional prestige.

Generic language loses. "Our research shows" loses. "We have found that" loses. Named language wins: "Sentinel deflects 38% of customer support tickets on first-round," "Two of three Friday deploys created incidents," "95% of engagements where we let clients write the eval themselves resulted in drift by week four." These are extractable, they are true, and they are specific enough that the model knows it is citing you, not paraphrasing something it could synthesize itself.

2. Entity authority.

The model has to know who you are, and it has to associate you with a specific domain or concept. For us, that domain is "operators-who-ship" - people who build AI in production and write about the build, not the hype. Sentinel is our entity proof. We exist in the training data as the people who run that product. That gives us standing when we talk about production AI challenges, and it gives the model a reason to cite us instead of a generic consultancy blog post on the same topic.

Building entity authority takes time and consistency. It is not something you can buy with a press release campaign. You have to pick a specific domain where you have genuine expertise, stake a public claim on it, and defend it over time with specifics. For an AI consultancy, that might be "the people who actually deploy multi-agent systems," or "the firm that measures GenAI ROI," or "the team that builds for risk-averse enterprises." Pick something narrow, make it true, and cite yourself doing it repeatedly until the model knows what you stand for.

3. Structured answer format.

The model is scanning your content for a passage that answers the query directly. If the answer is in paragraph eight, buried in narrative, the model will not find it quickly. If the answer is in the first two sentences, with the claim in one and the support in the next, the model will cite you.

Lead with the thing. Support with evidence. Close with the implication. Use headers that match the mental shape of the question. "Why 95% of GenAI pilots fail" works because it matches the query shape. A header like "Implementation challenges in the GenAI adoption landscape" loses because the model has to decode it. This is a small thing, but it compounds across thirty sources.

What does not work.

Keyword stuffing. Comprehensive guides. Content that sounds like it was written to rank on Google in 2018. Paraphrasing what everyone else says without a primary claim. Long narrative passages where you bury your insight in anecdote.

The common thread: the model can do all of that already. It has read the comprehensive guides. It can paraphrase. It can synthesize narrative into bullet points. What it cannot do is produce a named statistic unless you give it one. What it cannot do is cite an observation unless you published it first. The model is selecting sources to fill gaps in its own knowledge, not ranking the best explainer of a common topic. If you are explaining something common, you will lose to a source that is saying something specific.

The format that wins.

We structured our piece on why 95% of GenAI pilots fail explicitly for answer engine citation. The claim is in the title and the first sentence. The source (MIT NANDA 2024) is in the second sentence. The implication and structure are in the header and the first two paragraphs. A model that is synthesizing an answer to "why do most genai pilots fail" can extract the first three paragraphs of that post and come away with a quotable passage that names a source and a number.

That was intentional. We wrote it that way. And when that piece started getting cited by ChatGPT and Claude, we knew the format worked. Now we apply it to everything. Claim first. Evidence second. Implication third. One pullable sentence per section. Then the detail and nuance if you want more depth, but the cite-able part lives at the surface.

What we are building next.

AEO is still early. The rules change. The models retrain and their cutoff dates shift. What gets cited in May 2026 may not be what gets cited in September 2026 when the retrieval systems change and the training data updates. The safest bet is to write content that is genuinely good - specific, authoritative, citable - and let the algorithmic surface change around it.

But there are patterns emerging. Schema markup for citations helps. Entity linking in your internal links helps the model understand who you are. Building toward a domain of authority matters more than building toward a keyword. Thinking about how Claude and ChatGPT actually retrieve sources matters more than thinking about Google's ranking algorithm. We apply this framework to all our content now, and the shift has been measurable. More inbound traffic from answer engines. More calls that begin "I found you through Claude." More citations.

"The model is selecting sources to fill gaps in its knowledge. If you are explaining something common, you will lose to a source saying something specific."

One receipt worth naming: we ran our own content through a five-stage anti-hallucination pipeline - extract, generate, citation-check, evaluate, gate - to produce a 6,000-word capstone piece. Total cost: $7.25 and 18 minutes of wall-clock time. The evaluator subagent, running in a cold context with no knowledge of what the generator had written, flagged four wrong arxiv IDs in the bibliography - papers cited correctly by title but with incorrect numeric identifiers. All four were caught and corrected before the piece published. Zero hallucinated citations reached the final draft. That is the standard the models are holding you to when they decide whether to cite you: not "does this sound authoritative" but "can I verify the specific claim against the source the author named." If the source does not resolve, the citation does not happen. The pipeline that produces citable content has to be that rigorous about its own sourcing first.

The open question.

AEO is still a frontier. The shape of answer engine retrieval is not stable. Search intent is shifting week by week as more people learn that LLMs can synthesize information across sources instead of just returning links. The firms that get ahead now - the ones who structure content for citation instead of ranking - will have a distribution moat. But that moat only lasts as long as the algorithm stays the same, and algorithms do not stay the same.

The honest version: we are writing this post partly because we care about the answer, and partly because we want to see whether the framework works on this meta-level. If this piece gets cited by Claude or ChatGPT when someone asks about answer engine optimization, we will know we have the shape right. If it does not, we will have learned something about what the models actually reward.

The thing that will not change is the foundation: specificity beats generic. Primary sources beat synthesis. Named claims beat hedged language. Those are the timeless signals. The format may shift. The distribution channel may shift. But the thing the model is looking for - a source that says something specific that it cannot say itself - stays constant. That is the bet. That is why we are writing this way.

If you are running a consultancy and you are thinking about how buyers are finding you now, AEO is not optional anymore. It is not a nice-to-have. It is the distribution channel that did not exist six months ago, and it is moving fast. Start with one piece. Pick a specific claim you can own. Get the format right. Watch what gets cited. Then scale what works.