AI-engine citation feels like a black box until you look closely. It is not. The flow is roughly three steps — retrieve, ground, rank — and each step runs on signals you can understand and mostly control. The specifics differ across ChatGPT, Perplexity, Google AI Overviews, Gemini, Claude, and Copilot, but the shape is shared, and once you know the shape, the PDP work worth prioritizing becomes obvious.
Retrieval: query to documents
When a buyer types a shopping query, the engine first decides which web documents to pull into a working set — a mix of keyword matching, embedding-based semantic search, and, in some engines, signals from an underlying search partner. Retrieval is fast, looks at many candidates, and typically keeps 20 to 200 pages for the next step. Some engines also expand the query first — query fan-out — which is why a PDP can be cited on a question the buyer never literally asked.
If your PDP is not retrieved, nothing downstream matters. Retrievability means a clean URL, crawlable content (no JavaScript-only rendering for the parts that matter), an accurate title and H1, and language that matches what buyers actually type.
Grounding: what makes a page citeable
From the working set, the engine grounds its answer — pairing every claim it wants to make ("this jacket retails for $239 and weighs 8 ounces") with a source page that asserts those facts. Structured data makes grounding trivial. A page that declares price, SKU, brand, material, and weight in Product JSON-LD is much cheaper to cite than one that buries the same facts in marketing prose. As of mid-2026 the engines vary in strictness: Perplexity is the strictest about grounding, ChatGPT the loosest, Gemini between.
Ranking: who makes the final list
Among grounded candidates, the engine picks what to cite. Observable behavior is consistent with a mix of: how directly the page addresses the query, source authority, recency, how well the page's entity matches the one the query resolves to, and how diverse the final list should be. The practical reading: your PDP competes against review pages and marketplace listings in a contest that favors directness, authority, and freshness.