How to Optimise Specifically for ChatGPT (Citations, SearchGPT)
ChatGPT cites web content via SearchGPT (its web-search mode) and via browsing in some Pro features. Optimising for ChatGPT differs from optimising for Perplexity or Claude — different crawler (OpenAI-SearchBot, GPTBot), different content preferences, different citation surface. This guide covers ChatGPT-specific optimisation. Pair with the AEO guide and the Perplexity version.
Step-by-step: How to optimise for ChatGPT citation
- Allow OAI-SearchBot and GPTBot in robots.txt. SearchGPT uses 'OAI-SearchBot' user-agent (web search). GPTBot is OpenAI's training crawler. Allow both if you want ChatGPT visibility: 'User-agent: OAI-SearchBot\nAllow: /' and 'User-agent: GPTBot\nAllow: /'. Block neither unless you have specific copyright concerns.
- Verify content is server-rendered. ChatGPT/SearchGPT crawlers don't execute JavaScript reliably. Disable JS in DevTools → reload your key pages → content must be visible. If not, fix rendering before expecting ChatGPT citations.
- Structure content for direct-answer extraction. Front-load value: first 100 words answer the question. Use clear H2 questions, paragraph answers. Add FAQ sections with FAQPage schema. ChatGPT preferentially extracts content matching question-answer pairs.
- Add comprehensive structured data. Article + FAQPage + Organization schema on content pages. Product + AggregateRating on commerce. BreadcrumbList everywhere. ChatGPT/SearchGPT weight schema heavily when deciding what to cite.
- Build entity recognition signals. ChatGPT cites entities it recognises — companies and authors with Wikipedia entries, consistent profiles across LinkedIn, Twitter, GitHub. Establish your brand and authors as recognisable entities. Crunchbase, Wikidata, About Us page with structured Organization schema.
- Track ChatGPT citations. Search 'site:yoursite.com' inside ChatGPT (with SearchGPT enabled) on representative queries. Note which pages get cited. Tools: Profound, Otterly, HubSpot AI Search tracking, ranklytics.ai. Manual sampling weekly; tooling for at-scale tracking.
- Iterate on cited content. When ChatGPT cites a page, examine what made it citable — the angle, structure, schema. Replicate that pattern in adjacent content. Adverse: when high-value pages aren't cited, restructure to look more like the cited ones.
Frequently Asked Questions
Does ChatGPT have its own user-agent?
Yes — multiple. GPTBot (training crawler, opt-out via robots.txt to exclude from future training). OAI-SearchBot (live search for SearchGPT mode). ChatGPT-User (when a user with browsing-enabled Pro asks ChatGPT to visit your site). Each has distinct robots.txt behaviour. Allow OAI-SearchBot + GPTBot for ChatGPT visibility.
How is SearchGPT different from regular ChatGPT for citations?
SearchGPT (rolled into ChatGPT for Plus users 2024-2025) explicitly queries the web and cites sources inline with [1], [2] markers. Regular ChatGPT without SearchGPT relies on training data — your content gets used but rarely cited by name. SearchGPT is the citation surface.
Why isn't ChatGPT citing my Shopify product pages?
Common: 1) Shopify edge layer rate-limits OAI-SearchBot occasionally (rare but happens). 2) Product schema incomplete (missing brand, aggregateRating). 3) Reviews thin (<3 verified reviews). 4) Generic descriptions matching many other stores. Audit each; fix the gap most prominent to your products.
Does ChatGPT prefer fresh content?
Yes — SearchGPT shows preference for recently-updated authoritative sources. Updating evergreen guides quarterly improves citation likelihood. Adding new sections, updating data points, refreshing schema dateModified field all signal currency.
Can I block GPTBot but allow SearchGPT?
Yes. robots.txt: 'User-agent: GPTBot\nDisallow: /' blocks training crawler; 'User-agent: OAI-SearchBot\nAllow: /' allows live search. Some publishers (NYT, etc.) make this split — block training, allow live citations. Most stores benefit from allowing both.