Custom Web Audits
Run Audit Demo Audit Audit Types Free Tools Pricing What We Analyze 🔒 Login
← All free tools
🤖

AI Bot Access Checker

Check whether your robots.txt lets the AI crawlers — GPTBot, ClaudeBot, Google-Extended, PerplexityBot, CCBot and more — read your site. Decide whether you want to appear in AI answers or keep your content out of training.

⚡ Interactive demo — sample data
3 of 12 AI crawlers are blocked in this sample robots.txt — training bots blocked, search/answer bots allowed.
OAI-SearchBot — Allowed — OpenAI ChatGPT search (sends citation traffic) Looks good
ChatGPT-User — Allowed — OpenAI live browsing when a user asks Looks good
GPTBot — Blocked — OpenAI model-training crawler (Disallow: /) Issue
CCBot — Blocked — Common Crawl, feeds many AI models (Disallow: /) Issue
PerplexityBot — Allowed — Perplexity answer engine Looks good
Google-Extended — not blocked here; remember it only affects Gemini/AI training, never Google Search indexing Warning

About this tool

Check whether your robots.txt lets the AI crawlers — GPTBot, ClaudeBot, Google-Extended, PerplexityBot, CCBot and more — read your site. Decide whether you want to appear in AI answers or keep your content out of training.

How it works

Enter your website URL or domain
Paste any domain and run the check. We fetch your live robots.txt from the root of the site (e.g. https://example.com/robots.txt) — the same file every AI crawler reads before it decides what it's allowed to download.
See which AI bots are allowed or blocked
We parse the User-agent and Disallow rules and check each known AI crawler — GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, Google-Extended, PerplexityBot, CCBot, Bytespider, Amazonbot, Applebot-Extended and meta-externalagent — against the rule that applies to it. A bot-specific group wins over the catch-all User-agent: *, exactly like the real crawlers resolve it.
Decide your AI strategy and update robots.txt
Each bot is marked Allowed or Blocked with a note on who runs it. Use that to make a deliberate choice — appear in AI answers, or keep your content out of AI training — then edit robots.txt and re-run to confirm the rules land the way you intended.

What we check

Common issues we catch

Where this matters

Frequently asked questions

Will blocking AI bots hurt my Google rankings?
No. The AI-training tokens are separate from the crawler that indexes you for Search. Blocking GPTBot, ClaudeBot, CCBot or Google-Extended has no effect on your normal Google Search rankings. It only changes whether your content can be used by AI systems — which is a content-strategy choice, not an SEO penalty.
Should I block AI crawlers or allow them?
It depends on your goal. Allowing them helps your brand appear in ChatGPT, Gemini, Perplexity and other AI answers — increasingly a real source of referral traffic. Blocking them keeps your content out of AI training and answer engines. Many sites split the difference: allow the search/answer bots that send traffic, block the pure training crawlers.
Does Google-Extended stop Google from indexing my site?
No. Google-Extended only governs whether your content trains Gemini and powers Google's AI features. Googlebot — the crawler behind Google Search — is controlled by a different token entirely. You can block Google-Extended and remain fully indexed in regular Google results.
Is robots.txt enough to actually keep AI out of my content?
Not entirely. robots.txt is the Robots Exclusion Protocol — a request that well-behaved crawlers honor voluntarily. Reputable bots like GPTBot and ClaudeBot obey it, but it isn't enforced. A crawler can ignore it, and some AI agents use a generic browser user-agent with no identifying token at all. For hard enforcement, block at the server or WAF level.
What's the difference between GPTBot and ChatGPT-User?
GPTBot is the automated crawler that gathers content which may be used to train OpenAI's models. ChatGPT-User only fetches a specific page when a person explicitly asks ChatGPT to visit it — it's not bulk crawling or training. OAI-SearchBot is a third token that indexes pages for citations in ChatGPT search.
How do I block a specific AI bot in robots.txt?
Add a group naming the bot's exact user-agent token followed by a full disallow — for example: User-agent: GPTBot then Disallow: /. To allow it everywhere, use Disallow: (empty) or simply leave the bot out of any blocking group. The token name must match exactly what the crawler announces or the rule won't apply.
How long until my robots.txt change takes effect?
Crawlers re-fetch robots.txt periodically rather than on every request, so a change isn't instant. As a rough guide, OpenAI notes it can take around 24 hours for its search systems to pick up a robots.txt update. Other crawlers vary, but most reflect changes within a day or two.
Why does the checker say a bot is allowed when I tried to block it?
The usual causes are a mismatched token name, a more specific (and permissive) group for that bot higher in the file overriding your wildcard rule, or the disallow not being a full Disallow: / for the whole site. Each crawler obeys only the single most specific User-agent group that names it — fix the token or the group and re-run.

This is one of several free SEO tools from Custom Web Audits. For a complete, prioritized analysis of your whole website, run a full audit.