Say Hello

THE CLOUDFLARE
AI BOT BLOCK.

Cloudflare started defaulting to AI-bot blocking for many onboarding flows in 2025. Across the 47-site research network, roughly 30% of sites had at least one AI bot silently blocked without the owner realising. This is detection plus fix.

Sites Affected
~30%
Default
Block
Detection
Minutes
Fix Time
Minutes

THE DEFAULT THAT BROKE A LOT OF SITES.

When Cloudflare added the "Block AI Crawlers" feature in 2025, they made it easy to enable during domain onboarding. Many sites - including ones with active SEO and content marketing strategies - silently opted in without their owners realising. The toggle is small, the wording is reassuring ("protect your content from AI training"), and the implication for AI search visibility is rarely flagged.

Across the 47-site research network plus a broader audit sample, roughly 3 in 10 sites had at least one major AI bot silently blocked at the Cloudflare layer. The owner would not see this in Google Search Console, would not see it in their analytics, and would only see it in raw Cloudflare logs - which most owners never look at.

Finding 01.
How to detect

THE 5-MINUTE DETECTION.

Run a curl with each major AI user-agent and check the response. From any terminal:

  • curl -I -A "GPTBot/1.0 (+https://openai.com/gptbot)" https://yoursite.com/
  • curl -I -A "ClaudeBot/1.0 (+https://www.anthropic.com/claudebot)" https://yoursite.com/
  • curl -I -A "PerplexityBot/1.0 (+https://www.perplexity.ai/perplexitybot)" https://yoursite.com/

WHAT THE RESPONSES MEAN.

HTTP 200: bot is allowed. Good.

HTTP 403: blocked. Almost certainly Cloudflare.

HTTP 429: rate-limited. Could be Cloudflare bot fight mode or an aggressive WAF rule.

Connection reset / timeout: Cloudflare bot fight mode at the network level.

Cloudflare challenge page (HTML with "Just a moment..."): bot fight mode JS challenge - the bot cannot solve it, so this is effectively a block.

Finding 02.
The four likely culprits

WHY THE BOT IS BLOCKED.

If the curl test shows blocking, one of these four is responsible:

  • Block AI crawlers toggle (Cloudflare dashboard -> Site -> Bots -> Settings). The most common cause. Disable to fix.
  • Bot fight mode (Bots -> Bot fight mode). Blocks anything that fails JS challenges. AI bots cannot solve them. Disable or set to "Super bot fight mode" with "Allow verified bots".
  • WAF rule blocking by user-agent (Security -> WAF -> Custom rules). Look for rules with http.user_agent contains "bot". These overcatch AI bots.
  • Aggressive rate-limiting (Security -> Rate limiting). Rules tuned for human traffic catastrophically block bot crawl bursts.

THE FIX SEQUENCE.

In order, applied surgically:

THE BOTTOM LINE.

Cloudflare is excellent infrastructure with terrible defaults for AI visibility. The block is silent, the symptoms are invisible in standard analytics, and most owners do not know they have it. Run the curl test on every domain you own. Twice a year. Especially after any Cloudflare onboarding or settings change.

Stop Guessing What AI Sees

MEASURE THE LEVERS
THAT ACTUALLY EXIST.

If you want this methodology applied to your specific site - your real logs, your real citation data, your real fix list - the audit is the productized way to do it.