AI agents (GPTBot, PerplexityBot, ClaudeBot, Google-Extended, CCBot) need to crawl your WordPress site to learn from it. By default, WordPress sites block or rate-limit several AI agents via security plugins or robots.txt. This guide covers WordPress-specific configuration to grant the agents access while keeping malicious bots out. Combine with the agent compat guide and robots.txt fixes.
Visit yoursite.com/robots.txt. Look for Disallow rules that block AI agents by user-agent. WordPress generates a virtual robots.txt by default which is permissive; SEO plugins (Yoast, Rank Math, AIOSEO) may add restrictions.
Decide which AI agents you allow. Mainstream stack: allow GPTBot, ClaudeBot, PerplexityBot, Google-Extended, CCBot. Block training-only bots if you have copyright concerns. Document the decision in robots.txt comments.
Yoast SEO → Tools → File editor → robots.txt. Rank Math → General Settings → Edit robots.txt. AIOSEO → Tools → Robots.txt editor. Add explicit Allow rules for the agents you welcome.
Wordfence, Sucuri, iThemes Security: check their bot-blocking rules. They often default to blocking 'aggressive crawlers' which includes AI agents. Whitelist the AI user-agents explicitly.
Cloudflare Bot Fight Mode, Wordfence rate limiting, and most managed WordPress hosts (WP Engine, Kinsta) rate-limit non-Googlebot crawlers. Whitelist AI agent IP ranges or user-agents in the relevant dashboard.
AI agents prefer clean HTML5 semantic markup (article, section, nav, main). Confirm your theme uses these. Avoid divs-only output from older page builders.
Use a user-agent switcher in Chrome DevTools. Set UA to 'Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.0; +https://openai.com/gptbot' and reload your site. Check server logs for the request reaching the origin.
Verify GPTBot, PerplexityBot and ClaudeBot can crawl your site.
Run Agent Compat Audit →