How AI assembles
its answers
A customer types "best solicitor near me" into ChatGPT and gets a complete answer in seconds. Three firms are named, reasons given, sources cited. That answer looks simple, but the process behind it involves four distinct steps: understanding the question, retrieving sources from the web, weighing and combining information, and generating a coherent response. Each AI platform handles these steps differently, which is why ChatGPT, Gemini and Perplexity often recommend different businesses for the same question. This guide explains exactly how that process works and what it means for your business.
54%
of UK adults now use AI tools
75%
of UK adults have read AI-generated summaries
4
steps from question to AI answer
<1%
of ChatGPT citations are consistent across queries
From question to answer: the four-step process
When you ask an AI search engine a question, the system runs through four steps in seconds. Each step determines which businesses get mentioned and which sources get cited. Understanding this process is the first step to improving your AI visibility.
Step 1: Understanding the question
AI does not match keywords like Google does. It interprets meaning. When someone types "good plumber in Bristol who does emergencies," AI recognises this as a local service query with urgency. It identifies the trade (plumbing), the location (Bristol), the qualifier (emergency callouts) and the implied need (someone with availability and good reviews).
Context matters too. If earlier in the same conversation the user mentioned they rent out properties, AI factors that in. It might prioritise plumbers experienced with landlord certificates and multi-property contracts. This contextual understanding is fundamentally different from how traditional search engines work, and it changes which businesses appear.
Step 2: Retrieving sources
Based on the interpreted question, AI retrieves relevant sources. On platforms with real-time search capabilities (ChatGPT Browse, Perplexity, Gemini), the system searches the web, fetches pages, reads them and selects relevant passages. This is the "Retrieval" part of RAG (Retrieval Augmented Generation), the technology that powers most AI search engines. AI does not read your entire website. It extracts passages. It pulls the sections most relevant to the question and feeds them to the language model.
Step 3: Weighing and combining
AI combines information from multiple sources. If four sources say Firm X is a reliable solicitor in Manchester and one source says Firm Y is better, AI weighs not just the number of mentions but the authority of each source. A mention in the Law Society Gazette carries more weight than a forum post. A Google Business profile with 300 reviews matters more than a listing on an obscure directory.
This weighing process is where consistency across platforms becomes critical. If your business appears consistently on Trustpilot, Google Business, industry directories and your own website, AI has multiple confirmation points. If your information is fragmented or contradictory, AI is less confident about recommending you. You can read more about this in our guide on how AI recommends businesses.
Step 4: Generating the response
Based on the combined information, AI generates a coherent answer word by word. Each word is chosen based on probability, informed by both the retrieved sources and the model's training data. The response includes the selected businesses, an explanation of why they are recommended and references to the sources used. This is not a copy-paste operation. AI synthesises information from multiple sources into a new, original answer each time.
Retrieval Augmented Generation: the engine behind AI answers
RAG stands for Retrieval Augmented Generation and is the technology that powers most AI search engines. The principle is straightforward: instead of relying solely on what the model learned during training, it retrieves current information from the web for every question and incorporates that into its answer.
Without RAG, ChatGPT could only answer based on its training data, which is always outdated. With RAG, it can pull in current information: recent reviews, current prices, new businesses that opened last month. This is the reason AI answers in 2026 are far more current and specific than they were two years ago.
For UK businesses, this has a direct implication. Your online presence is not just about Google rankings any more. Every page on your website, every Trustpilot review, every mention in a trade directory is potentially retrievable by AI. The question is whether AI can find it, understand it and use it.
How the retrieval process works
The retrieval process works like this: AI sends a search query to the web, receives a list of potentially relevant pages, reads those pages and extracts the most relevant passages. Those passages are offered as context to the language model, which processes them into its answer. The model effectively "reads" a selection of the internet before responding. The quality of this retrieval step directly determines the quality of the answer.
The selection of which pages to retrieve is not random. AI uses signals similar to search engine rankings: domain authority, page structure, freshness and relevance to the query. But there are differences. AI also considers whether a page contains quotable passages, whether the information is specific enough to cite and whether multiple sources confirm the same facts.
Research shows that 40% of sources cited in Google AI Overviews come from positions 11 to 20 in traditional search results. This means a page ranked 15th on Google can still appear in an AI-generated answer. Content quality and citeability matter more than your exact Google position. A well-structured page at position 12 gets cited whilst a vague page at position 3 gets skipped.
AI does not read your entire page. It extracts passages. Put your most important information at the top, under clear headings, in short paragraphs of 40 to 60 words that make sense on their own.
This has direct implications for how you structure your content. Your page is not read and understood as a whole. The system extracts passages. If your most important information is buried in a long paragraph halfway down the page, there is less chance it gets picked up than if it sits prominently at the top under a clear heading. Content structure determines whether AI finds the right information about your business.
How each platform assembles answers differently
ChatGPT
Combines training data with real-time Browse results. Weighs existing knowledge heavily and supplements with current sources. Generates detailed, narrative answers. Cites sources selectively. Averages 3.86 citations per response. Holds 60.4% market share among AI search tools.
Approach: knowledge first, real-time supplement
Gemini
Direct access to Google's search index and Knowledge Graph. Pulls sources from the same index as Google Search. Integrates Google Maps and Google Business data directly into answers. Strong on local queries. Grew 146% year-on-year in the UK.
Approach: Google ecosystem as foundation
Perplexity
Purely RAG-based: searches the web for every question. Always cites sources with numbered references in the text. Most transparent about which sources are used. Averages 7.42 citations per response, nearly double ChatGPT's rate. Grew 100% year-on-year in the UK.
Approach: real-time search, always cite
Google AI Overviews
Selects from existing Google search results. 40% of citations come from positions 11 to 20. Summarising in nature. Less inclined towards specific business recommendations. Triggers an 83% zero-click rate when it appears. Averages 6 to 8 citations.
Approach: summarise the Google index
Is your content being picked up by AI?
VestVale automatically monitors whether ChatGPT, Gemini, Claude and Google AI cite your business. See which questions lead to citations and which do not.
Which sources AI cites most often
Not all sources are equal in the eyes of AI. Research shows clear patterns in which types of content get cited most frequently. Reddit accounts for 46% of all AI citations. YouTube follows at 32%. These are surprising numbers for many marketers who focus their efforts on traditional websites and blog posts.
The reason for Reddit's dominance is that AI platforms place high value on authentic, human experiences. Reddit posts contain personal experiences, opinions and recommendations that AI uses as social proof. When ten Reddit users recommend a particular estate agent in Leeds, AI treats that as genuine feedback rather than marketing copy.
For UK businesses, this matters because your customers are already talking about you on these platforms. If someone asks about a plumber in Sheffield on a local subreddit and three people recommend your business, that thread becomes part of AI's source material. Equally, negative mentions carry weight. A complaint thread about slow service becomes context that AI factors into its recommendations.
YouTube as a hidden citation source
YouTube is frequently cited because it contains rich, detailed information. Tutorials, reviews and analysis videos often go deeper than blog articles. AI can read YouTube transcripts and process that information into answers. A detailed product review on YouTube can lead to an AI citation even if your website is not found. For tradespeople and service businesses, a simple video showing your work and explaining your process can become a citable source that AI picks up.
A striking trend in 2026 is the growing importance of Trustpilot and Checkatrade in UK AI results. These platforms carry particular weight for local and service businesses. When ChatGPT recommends a roofer in Birmingham, it frequently cites Trustpilot scores and Checkatrade ratings as supporting evidence. Having a strong presence on these platforms directly increases your chance of being mentioned.
Industry-specific directories also play an outsized role. For solicitors, the Law Society's Find a Solicitor tool is a key source. For accountants, ICAEW and ACCA directories matter. For builders and tradespeople, the Federation of Master Builders and Gas Safe Register carry weight. AI treats these authoritative directories as trusted sources because they involve verification and professional standards.
40% of AI citations come from Google positions 11 to 20. Content quality and citeability matter more than your exact Google ranking.
For businesses, this means you should not focus solely on your website. Being active on Reddit (in relevant subreddits, without spamming), creating YouTube content, maintaining strong Trustpilot profiles and appearing in trade directories all increase your chances of AI citation. The sources AI cites are not the same sources Google ranks highest. Diversifying your online presence gives AI more material to work with when it assembles an answer about your industry.
Why the same question produces different answers
Ask ChatGPT "best accountant in Manchester" twice, five minutes apart, and you will likely get different answers. This is not a bug. It is fundamental to how language models work. Fewer than 1% of ChatGPT citations remain consistent when the same question is asked repeatedly. The UK shows 60% citation volatility in AI results, higher than France at 42%.
Several factors cause this variation. First, the retrieval step may return different web pages each time, especially if the search index has been updated. Second, the language model has a "temperature" setting that introduces controlled randomness into word selection. This prevents AI from giving identical, robotic answers but means businesses cannot guarantee a permanent recommendation.
What this means for monitoring
Because answers vary between sessions, checking your AI visibility once tells you very little. You need to monitor consistently over time. A single query where your business appears does not mean you always appear. Equally, one session without a mention does not mean you are never recommended. VestVale tracks your AI visibility across multiple queries and platforms over time, giving you a reliable picture rather than a snapshot. This is why monitoring local AI visibility matters more than ever.
Cross-platform differences add another layer. ChatGPT, Gemini and Perplexity use different source indexes, different ranking algorithms and different language models. A business that appears consistently in ChatGPT might be absent from Gemini because Gemini pulls from Google's Knowledge Graph whilst ChatGPT uses its own web crawling. For a detailed comparison, see our guide on why AI results differ between platforms.
The practical takeaway is that AI visibility is not binary. Your business is not simply "visible" or "invisible." It exists on a spectrum, appearing in some percentage of relevant queries, on some platforms, for some variations of the question. A strong AI visibility strategy aims to increase that percentage across all platforms, not to guarantee a single result.
This is different from traditional SEO, where you can rank at position 3 for a keyword and stay there for weeks. In AI search, your position fluctuates with every query. The businesses that appear most consistently are those with the strongest, most diversified online presence: strong website content, consistent reviews, multiple directory listings and mentions across platforms. Consistency of information across these sources is what gives AI confidence to recommend you.
Writing content that AI picks up and cites
Write in quotable blocks
AI extracts passages, not entire pages. Write paragraphs of 40 to 60 words that make sense on their own. Each paragraph should make a concrete point that AI can use without needing context from the rest of the page. Think of each paragraph as a potential citation.
Lead with the answer
Skip long introductions. Start each section with a direct answer to the question posed in the heading. AI favours content that leads with facts, not context. Use the inverted pyramid: most important information first, supporting detail after.
Use specific headings
H2 headings phrased as questions work best. "What does a solicitor charge in London?" is better than "Our fees." AI matches user questions to headings on your page. The closer your heading matches the question, the more likely your content gets retrieved.
Include concrete facts
Numbers, percentages, prices and specific data make your content quotable. "Our service is affordable" is not citable. "Our bookkeeping starts at £79 per month for sole traders" is. Specificity is what AI needs to build its answers.
Add FAQ sections
FAQs contain the exact format AI needs: a question followed by a direct answer. When a user asks AI the same question, your answer is likely to be picked up. Structure your FAQs with real questions your customers ask, not marketing-driven ones.
Add schema markup
JSON-LD structured data gives AI a machine-readable summary of your page. FAQ schema, HowTo schema and LocalBusiness schema measurably increase your citeability. This is especially important for UK service businesses competing for local AI visibility. More on this in how AI reads websites.
How context and intent shape the answer
A fundamental difference between AI answers and Google search results is conversational context. With Google, every search is standalone. You type a keyword, you get results, done. With AI chat platforms like ChatGPT, there is conversation history. Earlier messages in the conversation influence the answer to your next question.
If someone first says "I run a limited company in Edinburgh with five employees" and then asks "Which accountant do you recommend?", AI factors in that context. The answer will include accountants who specialise in limited companies, preferably with experience handling payroll and HMRC compliance in Scotland. Without that context, AI would give a more generic list.
Specificity wins over generality
This has implications for how you build your online profile. The more specifically you describe who you serve and what you specialise in, the better AI can match you to specific user questions. An accountant whose website states "Specialist in limited companies with 1 to 10 employees, Edinburgh and the Lothians" will be recommended more often for exactly that audience than one who writes "We help businesses with their accounts."
The role of intent is growing as AI models become better at understanding what users actually want. According to Ofcom, 75% of UK adults have encountered AI-generated search summaries. As familiarity grows, so does trust, and users increasingly rely on AI to select the right sources and deliver the right answer. That trust makes AI's selection process even more consequential for businesses.
Question wording matters enormously. "Cheap solicitor London" produces a different answer from "Experienced solicitor London." AI interprets "cheap" as price-focused and highlights firms advertising fixed fees or free consultations. "Experienced" leads to firms with decades of practice and high review scores. Same profession, same city, but different firms in the answer.
Reviews as context for AI matching
For businesses, this means you should optimise not just for your core terms but also for the words your customers use to describe you. If your clients call you "thorough" and "professional" in Trustpilot reviews, those are the words AI picks up. If clients describe you as "quick" and "affordable," you get recommended for different questions. Your review language literally becomes the context AI uses to match you to user queries. This is why managing your online reputation is now directly tied to your AI visibility.
Frequently asked questions
Does AI read my entire website or just fragments?
Fragments. AI extracts relevant passages from your pages, not the full text. This is why content structure is critical: clear headings, short paragraphs that stand alone and concrete facts at the top of each section. The easier AI can find a relevant passage, the greater your chance of being cited in an answer.
Does my content need to be recent to get cited?
It depends on the type of content. For fact-sensitive queries (prices, statistics, current developments), AI favours recent sources. For definitions, processes and principles, age matters less. Generally, regularly updated content performs better than static pages. Updating key pages quarterly is a good practice for UK businesses.
Why does AI cite Reddit and YouTube so frequently?
Reddit contains authentic, human experiences that AI uses as social proof. YouTube contains detailed reviews and tutorials with more depth than many blog articles. AI values these sources because they offer genuine experiences and detailed information that is often missing from standard business websites.
Can I influence how AI processes my content?
Yes. Write in quotable blocks of 40 to 60 words, lead with the answer, use specific headings, include concrete facts and add schema markup. These changes make it easier for AI to find, understand and cite your content. They do not guarantee a mention, but they significantly improve your odds.
Why do I get different AI answers each time I ask?
AI answers vary because of three factors: the retrieval step may return different pages, the language model uses controlled randomness in word selection, and the conversation context differs. Fewer than 1% of ChatGPT citations remain consistent across repeated queries. This is why ongoing monitoring matters more than one-off checks.
What is the difference between RAG and training data?
Training data is what the AI model learned during its training phase, which has a knowledge cutoff date. RAG (Retrieval Augmented Generation) adds real-time web search on top of that. When ChatGPT uses Browse mode, it combines both: its trained knowledge plus current web results. This is why keeping your website and profiles updated matters for AI visibility.
Is your content being cited by AI?
VestVale automatically monitors whether ChatGPT, Gemini, Claude and Google AI cite your business. See which content gets picked up and which does not.
From £19.95/mo ex. VAT. Cancel monthly.