HOW AI EXTRACTS.
When an AI engine processes a page for citation, it scans for content that directly answers the user's query. The scan is biased toward content near the top: above-the-fold paragraphs, opening sentences of <article> blocks, summary blocks. Content buried 2,000 words deep is rarely surfaced even when it is the better answer.
The pattern that wins extraction is the inverted pyramid: state the answer first, then provide context, then nuance. Journalism has used this for a century; AI extraction has revived it as a structural advantage.
THE PATTERN.
Three parts, in order:
- 1. Question or premise: 1-2 sentences naming the problem the page addresses. Either as a heading or as the first paragraph.
- 2. Direct answer: 1-3 sentences giving the literal answer. Under 60 words. Citable in isolation.
- 3. Context: the rest of the article. Caveats, examples, deeper reasoning, alternative views. Supports the answer; does not bury it.
EXAMPLE.
Anti-pattern (narrative open):
"In 2024, OpenAI began restructuring its crawler infrastructure. The change took place gradually over several months and reflected a broader trend in the AI industry toward separating training and search functions. Meanwhile, the SEO community was slow to react..."
Pattern (answer-first):
"OpenAI now operates three distinct crawler bots: GPTBot for training, OAI-SearchBot for the ChatGPT search index, and ChatGPT-User for on-demand fetches. Each has a different purpose and different correct robots.txt policy. Below is the breakdown."
Both versions can be the opening of the same article. The first is more readable as long-form prose. The second is dramatically more extractable - the AI engine reads the first three sentences and has the answer. Use the second.
WHERE IT DOES NOT APPLY.
The pattern is for informational and reference content. It is not for:
- Narrative storytelling - case studies, founder stories, profiles. Narrative arc requires withheld answers.
- Persuasion essays - opinion pieces where the structure of the argument is the value.
- Mystery / discovery genres - investigation pieces where the reveal is the payoff.
- Marketing-tone homepage copy - hero copy serves a different purpose (emotional pull) than informational content.
THE BOTTOM LINE.
Inverted-pyramid is the highest-leverage content pattern for AI extraction. Every research / how-to / reference article should default to it. Every long-form piece should have at least an answer-first summary block at the top, even if the body is narrative. Cost to implement: zero. Payoff: durable extraction lift across every AI surface.