When you ask an AI assistant a question, it rarely runs one search and reads one page. Modern answer engines run a pipeline: they decide whether to search at all, break your question into many smaller queries, fetch and re-rank candidate pages, pack the best passages into the model's context, then write an answer and attach citations. Understanding each stage explains the single most important fact for any brand: being named in the answer and being cited as a source are two separate outcomes, and you can win one without the other. This is the documented machinery behind the "black box," with the inferred industry practice flagged as such.
Stage one: the engine decides whether to search
Before any retrieval happens, the model makes a routing decision: answer from its training data, or go fetch live information. In practice, engines lean toward searching for time-sensitive, comparative, or factual queries (prices, news, reviews) and lean toward internal knowledge for stable, timeless questions. This decision is situational and varies by platform: Perplexity grounds nearly every response in live sources, while ChatGPT searches more selectively. The takeaway is that if the engine never searches, your content cannot be cited no matter how good it is.
Stage two: fan-out turns one question into many
Rather than searching your exact words, the engine decomposes the prompt into multiple related sub-queries and runs them in parallel, a technique Google publicly calls "query fan-out" in its AI Mode announcement, powered by a custom version of Gemini. Different sub-queries can be routed to different sources: the open web, a knowledge graph, shopping or local data, and structured feeds. The practical consequence is that you are not competing for one keyword anymore. You are competing across a spread of sub-questions the user never typed.
Stage three: fetch, re-rank, and pack the context
Each sub-query returns a set of candidate pages. Most engines are believed to re-rank those candidates by relevance and quality and keep only a small subset, discarding most of what it retrieved. The surviving passages, not whole pages, are then packed into the model's context window as the evidence it will read. Industry analyses consistently find that the large majority of retrieved pages are never used, so simply being retrievable is necessary but far from sufficient.
Stage four: generate the answer, then attach citations
The model writes the answer from the packed passages, and citation can happen one of two ways. Some systems generate the answer and its sources together so the text is tied to evidence as it is written; others generate the answer first and attach supporting links afterward, an approach documented in research on systems like RARR. The second method, sometimes called post-hoc attribution, can produce a citation that supports a claim without being the true origin of the wording. This is why a cited link does not always mean that page is where the assistant "learned" the fact.
The key insight: named is not the same as cited
Two distinct things can happen to your brand in an AI answer. You can be mentioned, where the model names you in the recommendation itself, or you can be cited, where your URL appears as a linked source. These are decided at different stages by different signals, so they do not move together. Industry analyses of AI visibility suggest relatively few brands consistently earn both, meaning your research can inform an answer that then recommends a competitor by name.
- AI answers come from a pipeline, not a single search: decide-to-search, fan-out into sub-queries, fetch, re-rank, pack, generate, then attribute. Optimize for the whole chain, not one keyword.
- Fan-out means you are judged across many sub-questions the user never typed. Google documents this as "query fan-out" in AI Mode, run on a custom Gemini model.
- Most retrieved pages are never used. Being findable gets you into the candidate pool; surviving the re-rank and making it into the packed context is the harder bar.
- Citations are not always proof of origin. Some engines attach sources after writing the answer, so a cited link may support a claim without being where the wording came from.
- Being NAMED in the answer and being CITED as a source are separate outcomes governed by different signals. Track both, because you can win one and lose the other, and the named brand usually captures the buyer.
