This week, Tim Soulo at Ahrefs dropped a thread synthesising over a billion AI search data points across 14 separate studies. The number that stopped me: brand-entry pages captured 57.7% of AI-driven traffic but only 3.0% of AI citations.
The short version
Ranking and citation are different jobs. Ranking is about where a page appears in a search interface. Citation is about whether an AI system treats a page as useful evidence when it builds an answer.
The practical implication is brutal: pages built only to rank may not be the pages answer engines cite. Brands need source-worthy content, not just optimized pages.
Read that again. Pages getting the visits are not the pages being cited. These are two almost entirely different sets of content.
Most AI SEO advice is solving the wrong problem.
The wrong brief
The conventional wisdom right now goes something like this: get your schema right, add FAQ sections, stuff in some structured data, target AI Overviews, watch your rankings. The tools are flooding the market, the playbooks are proliferating, and agencies are repackaging their old SEO decks with "AI-ready" stamped on the cover.
The problem is that AI search doesn't work like traditional search. When ChatGPT, Perplexity, or Google's AI Mode pulls a citation, it is not looking for who ranks highest. It is looking for who is most useful as a source. That is a completely different job to be done, and it requires a completely different strategy.
What the data shows
Ahrefs' synthesis of 14 studies, more than a billion data points, points to the same five citation signals over and over: YouTube mentions, Reddit community presence, "best of" list appearances, original data or research, and citable assets (stats, definitions, frameworks). Not metadata. Not schema. Not page speed.
The Princeton GEO research backs this up: including specific statistics improved AI citation rates by up to 40% versus unoptimised content. Meanwhile, SEO tactics like keyword stuffing actively underperformed the baseline. You cannot keyword-engineer your way into an AI citation. You have to earn it.
What earns it is being a source worth quoting. That means having a point of view worth attributing. A number no one else has published. A framework people reference. A community that already talks about you. These are credibility signals, not technical signals. And most businesses have spent the last two years getting very good at technical signals while ignoring the credibility infrastructure entirely.
Alicia Solis' April 2026 analysis of 40 sites makes the point even sharper: citations are distributed across discovery and evaluation pages, not just the home page. AI engines are citing the content that actually helps people make decisions, the comparison page, the breakdown, the in-depth guide, not the brand homepage doing a ring-fencing job.
The better question
If you are asking "how do I rank for AI search," you are already behind. The more useful question is "what would an AI engine cite me for?" That shifts the brief entirely.
It means building content that contains original observations, not recycled information. Publishing research with real numbers, even small-scale. Getting into the conversations on Reddit and YouTube where your category is already being discussed. Showing up on "best of" lists in your niche by being genuinely worth listing. Earning mentions from people who are themselves already cited.
None of that is fast. All of it compounds. That is the point.
Reputation beats metadata
Most businesses will keep chasing the technical fix because it feels more controllable. Run the audit, update the schema, tick the box. But AI engines are increasingly good at identifying sources that people actually trust and refer back to, and they are learning to prefer those over pages that have optimised for previous-era search signals.
Citation is a reputation game. Reputation takes time to build and is very hard to fake.
The businesses that understand this now, and start building citation-worthy content and community presence today, will have a structural advantage in AI search that no metadata tweak can replicate later.
What content is most likely to be cited by AI search?
AI systems are more likely to cite pages that reduce uncertainty: original research, clear definitions, statistics, comparison pages, practical frameworks, expert explanations, and evidence that is easy to attribute. Thin opinion pieces and generic service pages are weaker citation assets.
How should a business build a citation asset?
Pick one question buyers actually ask, answer it directly, add evidence, show your method, include specific examples, link to related pages, and make the page useful enough that a human would quote it in a meeting. That is the standard.
Evidence and further reading
- Google's guidance on generative AI features and useful, unique content
- Bing Webmaster Tools AI Performance public preview
- Bing's citation share and AI visibility insights update