Articles
Why Your Website Alone Won't Get You Cited by AI
If you have been focused solely on optimising your own website to appear in AI-generated answers, you may be missing the bigger picture. New research reveals that 70% of AI citations come not from a brand's own pages, but from third-party sources across the web. It is a finding that should fundamentally shift how businesses think about their digital presence in an age of AI-driven search.
If you have been focused solely on optimising your own website to appear in AI-generated answers, you may be missing the bigger picture. New research reveals that 70% of AI citations come not from a brand's own pages, but from third-party sources across the web. It is a finding that should fundamentally shift how businesses think about their digital presence in an age of AI-driven search.
According to research from Princeton and Georgia Tech (Aggarwal et al.), AI models draw the vast majority of their citations from off-site content. That means forums, independent reviews, news coverage, trade publications, academic sources, and marketplace listings are far more influential in shaping AI responses than your product pages or blog posts alone.
The split breaks down like this: roughly 30% of cited content comes from on-site sources such as product pages, technical documentation, blog guides, and FAQ sections. The remaining 70% comes from off-site content including user-generated discussions on platforms like Reddit, independent review sites, media and analyst mentions, government or academic sources, and marketplace listings.
This is not a reason to neglect your own website. Well-structured on-site content still plays a role. But it does mean that a strategy focused entirely on owned channels will leave significant ground uncovered.
The reason behind this split comes down to how AI systems build trust in the information they surface. There are three key factors at play.
Third-party Corroboration Carries Significant Weight
AI models synthesise answers from multiple independent publications, and brands that appear consistently across several unrelated sources are seen as more credible. A single well-written blog post on your own domain simply cannot replicate the trust signal that comes from being discussed, reviewed, and referenced across a range of external platforms.
Freshness Another Critical Factor
Around half of all content cited by AI is less than 13 weeks old. This means that a one-off PR push or a content piece published a year ago is unlikely to sustain visibility in AI responses over time. Consistent, ongoing coverage and fresh mentions across the web matter considerably more than a static archive of owned content.
Structured Formatting Influences Which Pages Get Cited
Research suggests that pages broken into scannable sections of roughly 120 to 180 words between headings are 70% more likely to be cited by models such as ChatGPT. Clear structure signals to AI systems that content is organised, readable, and authoritative.
The practical implication is clear: digital PR is no longer just a brand awareness tool. It is a direct lever for AI visibility. Earning coverage in reputable trade publications, securing independent reviews, generating genuine discussion on forums, and building a presence across trusted third-party platforms all contribute to the kind of multi-source corroboration that AI models respond to.
Brands that invest in earned media, encourage authentic user reviews, and maintain a steady cadence of external mentions will be far better positioned in AI-generated responses than those relying on owned content alone. This is a meaningful shift in how to think about content strategy, and the research suggests the gap between brands that adapt and those that do not will only widen as AI-driven discovery becomes the norm.
The 70/30 rule is a useful reminder that in the age of AI, your reputation across the wider web matters just as much as what you publish on your own site.
Ready to build your AI visibility beyond your website? Get in touch with our team today.


