AI citations
How to Get Cited by ChatGPT, Perplexity, and AI Overviews
Each of these systems decides what to cite differently, which means there's no single trick that works everywhere. This guide walks through how each one actually sources its answers, what makes a page or mention worth citing, and a concrete plan you can start on this week.
In this guide
How ChatGPT, Perplexity, and AI Overviews actually pick sources
People tend to talk about "getting cited by AI" as one problem. It isn't. The three systems founders care about most build their answers in different ways, and the difference matters more than most advice acknowledges.
ChatGPT
ChatGPT's answers come from a blend of two things: what was baked into the model during training, and, when it decides a query needs current information, a live web search it runs on the fly. For anything that sounds like "what's the best tool for X" or "compare A vs. B," it usually leans on browsing, because that kind of question benefits from current pricing, reviews, and product details. When it browses, it's pulling from indexed, crawlable pages the same way a search engine would, then summarizing across whatever it retrieves. If your product barely existed during the model's training window, browsing is effectively your only path in.
Perplexity
Perplexity is built around live retrieval by default. Almost every answer runs a real-time search, ranks the results, and then writes a response with visible numbered citations pointing back to the pages it used. Because it's retrieving fresh each time, recency carries more weight here than it does for a model answering from memory. A page published last month that directly addresses the query can outrank an older, more authoritative page that's gone stale. This is also the engine where you can literally watch which sources it picked, which makes it the easiest one to audit.
Google AI Overviews
AI Overviews draws from Google's existing search index, which means everything you already know about ranking still applies: crawlability, page quality signals, and topical relevance. It tends to favor pages that already rank well organically and that answer the query directly near the top of the page, rather than burying the answer under throat-clearing. It's less about being novel and more about being one of the clearest, best-supported answers already sitting in the index.
The practical takeaway: ChatGPT rewards being findable and well-summarized when it searches, Perplexity rewards being fresh and specific, and AI Overviews rewards being clearly the best existing answer to a well-defined question. You don't need three separate strategies, but you do need content that holds up under all three sets of criteria at once.
What makes something worth citing
Regardless of which engine is doing the summarizing, the underlying model is making a judgment call: is this page a trustworthy, specific answer to the question being asked? A few things push a source over that line.
- Clarity of claim. A sentence that states a fact plainly is easier to lift and cite than a paragraph that hedges, qualifies, and buries the point three clauses in.
- Specificity. "Wally helps founders find distribution channels" is forgettable. "Wally drafts Reddit and LinkedIn replies for founders to review and approve before posting" gives the model something concrete to repeat.
- Credibility signals. Author names, dates, real detail, and a domain that isn't obviously a content farm all factor in. Anonymous, undated pages get treated with more suspicion.
- Corroboration elsewhere. If three independent sites describe your product the same way, that description starts to look like consensus rather than marketing copy. Models pick up on that pattern.
- Freshness, especially for Perplexity. A page updated recently, or one that includes a visible date, has an edge in any engine that's retrieving live. Stale pages with no update signal get deprioritized over time even if the content is technically still true.
None of this is about gaming a ranking algorithm. It's closer to the standard a careful editor would apply: is this claim clear, specific, backed up, and current? If you write with that bar in mind, you're already most of the way there.
Your pages vs. what other people say about you
It's tempting to assume that getting cited is mostly a matter of publishing more pages on your own site. That's part of it, but it's not the biggest part. Language models are trained on and retrieve from a huge span of the web, and a paragraph you wrote about your own product carries less weight, on its own, than the same claim showing up independently in a Reddit thread, a comparison post on someone else's blog, or a review site.
The reason is straightforward: your own marketing copy is expected to be favorable to you. A third party saying the same thing has no obvious incentive to flatter you, so it reads as more credible, and it's exactly the kind of corroboration described above.
This is why community discussion matters so much for this specific goal. A Reddit thread where someone asks "what's a good tool for finding where to post as an indie hacker" and a handful of replies mention your product by name, with some context about what it actually does, is worth more for citation purposes than another landing page saying the same thing about yourself. Comparison posts, "best of" roundups, and review sites work the same way — they're framed as independent judgment, which is precisely what makes them citable.
The right mix is both: owned pages that state your value proposition clearly and are easy to crawl and summarize, plus a steady presence in the places where people already discuss and compare tools in your category. Owned content gives engines a clean, quotable source. Third-party mentions give them the corroboration that makes citing you feel safe.
A step-by-step plan for this week
You don't need a quarter-long content plan to start moving on this. Here's a sequence a solo founder can realistically work through in a week.
- Write down the five questions you want to be the answer to. Be specific: not "project management software" but "what's a good tool for indie hackers to find where their target users hang out online." Vague questions produce vague content that nothing can cite well.
- Audit what currently shows up when you ask those questions. Run them through ChatGPT and Perplexity yourself and note who gets mentioned, which pages get cited, and what language they use to describe the category.
- Write one clear, dated, specific page per question. State the answer plainly near the top. Don't make the model dig for it. Include real specifics — how something works, what it costs, what it doesn't do — rather than general claims.
- Go find the conversations already happening. Search Reddit, Hacker News, and Quora for people asking versions of your five questions. Reply with something genuinely useful, and only mention your product where it's relevant, not as a pitch.
- Get mentioned somewhere you don't control. Reach out about a comparison post, submit to a relevant directory, or answer questions on a review site. Independent corroboration is what turns a claim into a citable fact.
- Keep dates visible and pages current. Add a visible "updated" date and actually revisit the page when something changes. This matters more for Perplexity than the others, but it never hurts.
- Repeat weekly rather than doing it once. Citations compound. One good page rarely moves the needle; a steady drumbeat of clear pages and real mentions does.
This is also the kind of work that's easy to know about and hard to keep doing consistently — finding the right threads, drafting a reply that's actually useful, tracking which comparison posts are worth a pitch. That's the specific gap Wally is built for: it researches where these conversations are happening, drafts replies and posts for Reddit, LinkedIn, X, Hacker News, and similar channels, and queues them for your approval so the weekly cadence doesn't depend on you remembering to do it.
How to tell if it's working
The most direct check is also the simplest: periodically ask ChatGPT and Perplexity the exact questions you identified in step one, and see whether you show up, and how. Perplexity makes this easy since it shows its sources directly — if your page or a mention of you appears in that list, you have a clear signal. ChatGPT is less transparent about sourcing, but you can still track whether it mentions you by name and how accurately it describes what you do.
Beyond manual checks, watch for referral traffic from chat.openai.com, perplexity.ai, and similar domains in your analytics. It's usually a small number relative to organic search, but a nonzero and growing number is a real signal that citations are translating into actual visits. Pair that with basic brand-mention tracking — a saved search or alert for your product name — so you notice when a new thread or comparison post mentions you, since that's often the leading indicator before a citation shows up.
Be patient about the feedback loop. Indexing, retrieval, and model updates all lag behind publication, sometimes by weeks. Don't judge one page's performance after two days. Judge the trend after a couple of months of consistent, specific, dated content plus real engagement in the conversations where your category gets discussed.