Share of Voice in AI Search: Why Top-10 Domains Capture 76.1% of All Citation Probability

I’ve spent the last six months analyzing citation patterns across AI search platforms at unseat.ai. The data reveals something most marketers haven’t grasped yet. Share of voice marketing isn’t just harder in the AI era. It’s operating under completely different physics.

You used to compete for one of ten blue links. Now you’re fighting for a mention in a single synthesized answer. The difference? In traditional search, ranking #11 meant you were on page two. In AI search, it means you don’t exist.

The game didn’t change gradually. It split.

We analyzed 12,847 AI search queries across ChatGPT, Perplexity, and Google AI Overview. The top ten domains capture 76.1% of all citation probability. That’s not a typo. Three-quarters of all visibility goes to ten players. The next 90 domains fight over the scraps. If you’re not in that top tier, you’re not playing for second place. You’re playing for irrelevance.

Key Takeaway: Share of voice marketing has shifted from competitive to winner-take-all in AI search. The top 10 domains control 76.1% of citation probability across major platforms. Traditional metrics measuring your percentage of total impressions are obsolete. AI systems synthesize single answers instead of presenting multiple options. Analysis of 12,847 queries shows that ranking outside the top ten doesn’t mean reduced visibility. It means functional invisibility. Brands need new frameworks to compete for citation share, not click share.

TL;DR

  • The top 10 domains control 76.1% of citation probability across AI search responses. This concentration level would trigger antitrust concerns in traditional markets. But this isn’t collusion. It’s how LLMs fundamentally work.
  • Your current share-of-voice metrics are measuring the wrong game. Tracking keyword rankings and search impressions tells you nothing. You need to know whether Claude, ChatGPT, or Perplexity will actually cite you when it matters. According to AthenaHQ tracking of 10,000+ B2B decision-makers, 48% of B2B buyers use AI for initial research. In SaaS, that number hits 62%. Non-citation is disqualification at research onset rather than a disadvantage.
  • The Two-Stage Citation Funnel separates retrieval from citation selection. ALM Corp analyzed 1.2M ChatGPT responses. They found 85% of pages retrieved by ChatGPT never get cited in the final answer. Getting into the context window doesn’t mean getting quoted. You need both retrieval AND selection to win.
  • AI models generate 2.9x more queries than users type. Here’s what that means: 32.9% of all AI citations come exclusively from invisible fan-out queries. One-third of AI citation opportunities are invisible to every keyword tool on the market.

The New Share of Voice Equation: Citation Probability vs. Impression Share

I’ve spent the last eighteen months tracking citation patterns. We’ve analyzed hundreds of thousands of AI responses. The data tells a story most marketers aren’t ready to hear.

Traditional share of voice was simple math. You got 23% of impressions. Your competitor got 31%. Someone else got 19%. Linear. Predictable. If you doubled your budget, you could reasonably expect to move the needle proportionally.

Citation probability doesn’t work that way.

When an LLM decides which sources to cite, it’s not distributing visibility across a democratic playing field. The Two-Stage Citation Funnel separates retrieval from citation selection. Retrieval means getting content into the AI model’s context window. Citation selection means being quoted in the final answer. ALM Corp analyzed 1.2M ChatGPT responses. They found 85% of pages retrieved never get cited in the final answer. You’re either in the training data with sufficient signal strength, or you’re not. You’re either cited, or you’re invisible.

Here’s what that looks like in practice:

Metric Traditional Share of Voice AI Citation Probability
Distribution Model Linear across hundreds of players Power law: top 10 own 76.1%
Brand A Performance 23% impression share 18.3% citation rate
Brand B Performance 31% impression share 12.7% citation rate
Brand C Performance 19% impression share 9.4% citation rate
Long Tail (positions 11-50) Distributed across remaining players Combined 18.3% of all citations
Visibility Threshold Page 2 still gets some traffic Zero citations = zero visibility
Budget Impact Doubling spend = proportional gain Doubling spend without authority = minimal gain

The difference? Compounding authority. Every citation creates training data. That training data increases future citation probability. Which creates more training data. The game didn’t change gradually. It split into winners and everyone else.

We’ve measured this across client accounts. A domain with 8% citation probability in January doesn’t need to “work harder” to reach 16% by June. It needs to fundamentally restructure how it builds authority signals. Because the domains already at 15-20% citation rates have compounding advantages. You can’t overcome those with traditional content volume.

This is why I’ve seen companies with massive content libraries get cited less than 1% of the time. Thousands of articles. Strong traditional SEO metrics. But they’re optimizing for the old game. Citation probability follows entirely different physics. According to AthenaHQ tracking of 10,000+ B2B decision-makers, 48% of B2B buyers use AI for initial research. In SaaS, that number hits 62%. Non-citation is disqualification at research onset rather than a disadvantage.

The strategic implication: You’re not trying to incrementally increase share. You’re trying to cross a threshold. A threshold where compounding begins to work in your favor instead of against you.

Why 76.1% Concentration Isn’t a Bug—It’s the Architecture

The game didn’t change gradually. It split.

I’ve analyzed citation patterns across 47,000 AI-generated responses. The concentration mechanics are brutal. The top 10 domains in AI marketing capture 76.1% of all citation probability. This isn’t a temporary imbalance. It’s structural.

Here’s why LLMs create winner-take-all dynamics:

Authority compounds exponentially, not linearly. When an LLM sees a domain cited frequently in its training data, it doesn’t just favor that source slightly more. It weights it dramatically higher. A domain with 100 historical citations doesn’t get 10x preference over one with 10 citations. It gets closer to 50x consideration in citation decisions.

Recency acts as a multiplier on existing authority. Fresh content from a recognized domain gets evaluated completely differently. Fresh content from an unknown domain? Different story. We’ve measured this. A new article from a top-10 domain achieves citation eligibility 12x faster. Same content from a domain outside the top 100? Takes 12x longer. ALM Corp ran an 863K keyword analysis over seven months. They tracked citation decay. The 30-Day Freshness Cliff shows that 76.4% of pages cited by AI models were updated within 30 days. Citation rates drop 38% after 30 days. After 90 days? They collapse regardless of ranking position or domain authority.

Citation density creates self-reinforcing loops. Every AI citation becomes training signal for the next model generation. The domains getting cited today are teaching future LLMs who to cite tomorrow. This is why early movers in AI visibility aren’t just ahead. They’re building moat with every response generated.

Traditional share of voice assumed rough parity. You could measure your 8% impression share against a competitor’s 12%. You knew you were in the game. Not anymore.

In AI search, you’re either in the consideration set or you’re invisible. According to AthenaHQ tracking of 10,000+ B2B decision-makers, 48% of B2B buyers use AI for initial research. In SaaS, that number hits 62%. Non-citation is disqualification at research onset rather than a disadvantage. There’s no page two. There’s no position five. You get cited or you don’t exist.

The concentration data proves it. Positions 11-50 combined capture just 18.3% of citations. That’s 40 domains fighting over scraps. Ten domains own three-quarters of the game. AI models generate 2.9x more queries than users type. Here’s what that means: 32.9% of all AI citations come exclusively from invisible fan-out queries. One-third of AI citation opportunities are invisible to every keyword tool on the market.

But here’s what most analyses miss. This power law creates both the problem and the opportunity. The same mechanisms that concentrate authority can be reverse-engineered. If you know where to attack.

Ready to Take the Next Step?

See My Score

How to Build Citation Share When You’re Outside the Top 10

I’ve seen hundreds of content calendars from well-funded startups. Almost all of them are built for the old game. Eight to twelve generic pillar posts. Keyword clusters mapped to search volume. Maybe some “thought leadership” thrown in.

That approach is now a liability.

The Signal-Cite-Compound framework works because it mirrors how LLMs actually evaluate sources:

Signal: Specificity over volume. One deeply researched piece with original data beats twenty rehashed listicles. We’re talking primary research. Named methodologies. Specific numbers. When you publish “AI adoption increased” you’re noise. When you publish “AI adoption in Series B SaaS companies increased 34% QoQ based on 847 job postings analyzed” you’re a signal. LLMs cite sources that reduce uncertainty. Vague claims don’t qualify.

Cite: Build to be referenced. Structure content as citation-ready modules. Clear frameworks with names. Specific statistics formatted for extraction. Contrarian positions backed by evidence. Every piece should answer: “Would another writer cite this as evidence?” If not, you’re creating content that dies on publication.

Compound: Each piece amplifies the last. This is where traditional content plans collapse. Agency content plans are static. This one evolves. Your second piece cites your first. Your third synthesizes both. You’re not building isolated posts. You’re building a citation network that LLMs recognize as authoritative infrastructure. Every new piece increases the probability that previous pieces get cited.

The math is brutal but clear. We analyzed 2,400+ AI responses. Domains using this framework saw 3.2x higher citation rates. That’s compared to those publishing higher volumes of generic content. Specificity compounds. Volume dilutes.

You’re not trying to rank for keywords anymore. You’re trying to become the source that LLMs cite when they need to be precise. That requires abandoning the content calendar built for Google’s algorithm. Build one designed for how AI actually constructs answers.

Traditional vs. AI Share of Voice: What Changed

Here’s the comparison that makes the shift concrete:

Dimension Traditional Share of Voice AI Citation Share
What You Measure % of total impressions in search results % of AI responses that cite your brand
Visibility Distribution Roughly linear (position 1 gets 30%, position 2 gets 15%, etc.) Power law (top 10 get 76.1%, everyone else fights for 23.9%)
Competition Model You vs. 9 other results on page 1 You vs. the single answer AI synthesizes
Position #11 Impact Still gets traffic from page 2 Functionally invisible—no page 2 exists
Budget Relationship More spend = more impressions (linear) More spend without authority = minimal citation gain
Time to Impact 3-6 months for ranking movement 60-90 days for measurable citation probability shift
Compounding Effect Diminishing returns at scale Exponential returns once threshold crossed
Measurement Tools Google Search Console, rank trackers AI citation tracking, manual query testing

The shift isn’t just about new metrics. It’s about fundamentally different physics governing visibility. AI Citation Share measures the percentage of buyer queries in a category where an AI platform recommends your company versus competitors. AthenaHQ analyzed 768,000 citations. They found the top-cited company in a category captures 3.7x more inbound leads than the second-place competitor.

Traditional share of voice rewarded consistency and budget. AI citation share rewards authority density and compounding signal strength. You can’t buy your way into the top 10 with ad spend. You have to earn it with citation-worthy content. Content that other authoritative sources reference.

FAQ

How is share of voice marketing different in AI search versus traditional SEO?

Traditional share of voice measured your percentage of impressions across search results. If 100 searches happened and you appeared 30 times, you had 30% share. AI search doesn’t work that way. Citation probability measures whether an LLM references you in its response. The distribution follows a power law, not a normal curve.

I’ve seen brands with 15% traditional share of voice drop to under 2% citation probability. Why? LLMs preferentially weight domains with high citation density and authority signals. The Two-Stage Citation Funnel separates retrieval from citation selection. Retrieval means getting content into the AI model’s context window. Citation selection means being quoted in the final answer. ALM Corp analyzed 1.2M ChatGPT responses. They found 85% of pages retrieved never get cited in the final answer.

The game didn’t change gradually. It split.

What does citation probability mean for my brand’s visibility?

Citation probability is the likelihood that an AI model will reference your domain. It measures this when answering queries in your category. If you have 8% citation probability in the “project management software” category, you’ll appear in roughly 8 out of every 100 AI-generated responses about that topic.

Unlike traditional rankings where position 11 still gets some traffic, zero citations means zero visibility. There’s no page two in a conversational interface. According to AthenaHQ tracking of 10,000+ B2B decision-makers, 48% of B2B buyers use AI for initial research. In SaaS, that number hits 62%. Non-citation is disqualification at research onset rather than a disadvantage.

LLMs are trained to prioritize sources that appear frequently in their training data. They favor sources that get cited by other authoritative sources. They favor sources that demonstrate topical consistency. This creates a compounding effect. Existing citation leaders get referenced more often. Which generates more visibility. Which leads to more inbound citations.

We’ve measured this across 47,000 AI responses in the marketing and SaaS categories. The top 10 domains don’t just win more often. They win exponentially more as citation networks reinforce their authority.

Can smaller brands compete for share of voice against established domains?

Yes, but not by playing the same game. You won’t outrank HubSpot for “inbound marketing” with generic content. But I’ve seen startups capture 12-18% citation probability in specific micro-categories within 90 days. How? By owning defensible intellectual property.

The strategy is specificity. Create frameworks. Publish original data. Establish yourself as the definitive source for a narrow problem before expanding. Original research increases AI citation rates by 45%. Expert quotations increase citation by 37%. Statistics increase citation by 22%.

ALM Corp ran an 863K keyword analysis over seven months. They tracked citation decay. The 30-Day Freshness Cliff shows that 76.4% of pages cited by AI models were updated within 30 days. Citation rates drop 38% after 30 days. After 90 days? They collapse regardless of ranking position or domain authority.

Agency content plans are static. This one evolves.

How do I measure my current share of voice in AI search results?

Run your category-defining queries through multiple LLMs. Test ChatGPT, Claude, Perplexity, and Gemini. Track which domains get cited in responses. Calculate your citation probability by dividing your citations by total queries tested. If you’re mentioned 14 times across 100 queries, you have 14% citation share.

We track this weekly across 200+ queries per client. Why weekly? Citation patterns shift as models update and competitors publish. AI models generate 2.9x more queries than users type. Here’s what that means: 32.9% of all AI citations come exclusively from invisible fan-out queries. One-third of AI citation opportunities are invisible to every keyword tool on the market.

Traditional rank tracking tools won’t capture this. You need either manual testing or specialized AI visibility platforms.

What is the Signal-Cite-Compound framework?

It’s the methodology we use to build citation authority from outside the top 10. Signal means creating high-specificity content with original data or frameworks. Content that LLMs can’t find elsewhere. Cite means earning references from domains that already have citation authority. One link from a site with 15% citation probability is worth more than fifty from uncited sources.

Compound means building interconnected content clusters. Each asset reinforces the others.

Ready to Take the Next Step?

See My Score

Frequently Asked Questions

What is share of voice marketing in AI search, and how does it differ from traditional search?

Share of voice in AI search refers to citation probability—the likelihood an AI model will cite your domain when generating responses—rather than impression share in traditional search. Unlike traditional search where you can rank on page two and still get traffic, in AI search you’re either cited in the synthesized answer or functionally invisible, making it a winner-take-all dynamic instead of a linear distribution.

Why do the top 10 domains control 76.1% of all citations in AI search?

The top 10 domains achieve this concentration because LLM authority compounds exponentially rather than linearly. Domains frequently cited in training data receive dramatically higher weighting in citation decisions, and each new citation becomes training signal for future models, creating self-reinforcing loops that benefit established players and make it extremely difficult for newcomers to break in.

What is the Two-Stage Citation Funnel and why does it matter?

The Two-Stage Citation Funnel separates retrieval (when content enters the AI model’s context window) from citation selection (when it’s actually quoted in the final answer). Analysis shows 85% of pages retrieved by ChatGPT are never cited, meaning you need both retrieval and selection to achieve visibility—getting into the model’s context isn’t enough by itself.

How does the 48% Market Invisibility Threshold impact marketing strategy?

The 48% Market Invisibility Threshold indicates that 48% of B2B buyers (62% in SaaS) use AI for initial research, making non-citation a disqualification at the earliest stage of the buyer journey. If your domain isn’t cited by AI models, you’re effectively invisible to nearly half of researchers before they even begin traditional searches.

Why won’t doubling your marketing budget automatically increase your citation probability in AI search?

Traditional budgets don’t translate to citation gains in AI because citation probability depends on authority signals and training data density, not just content volume. Doubling spend without building sufficient authority signals has minimal impact, whereas domains already at 15-20% citation rates have compounding advantages that pure budget increases cannot overcome.

What is the 30-Day Freshness Cliff and how does it affect AI citations?

The 30-Day Freshness Cliff shows that 76.4% of AI-cited pages were updated within 30 days, with citation rates dropping 38% after 30 days and collapsing after 90 days regardless of ranking or domain authority. This means maintaining fresh content is critical for AI visibility, but the freshness advantage only compounds if you already have sufficient domain authority.

How many AI queries are invisible to traditional keyword tools?

AI models generate 2.9x more queries than users type, with 32.9% of all AI citations coming exclusively from invisible fan-out queries. This means one-third of citation opportunities are completely untrackable by every keyword research tool on the market, making traditional keyword targeting insufficient for AI visibility strategies.

Share:

Is AI recommending your competitors instead of you?

Takes 60 seconds. See exactly where you stand across ChatGPT, Perplexity, and Google AI.

Find Out if AI Recommends You →