Install
openclaw skills install robotstxt-aiAnalyze and generate robots.txt files with AI crawler awareness. Detect which AI bots (GPTBot, ClaudeBot, PerplexityBot, Google-Extended, etc.) are blocked or allowed on any website.
openclaw skills install robotstxt-aiAnalyze, audit, and generate robots.txt files with full awareness of 20+ AI crawlers.
You know about these AI crawlers and their user-agents:
| Bot | User-Agent | Company | Type |
|---|---|---|---|
| GPTBot | GPTBot | OpenAI | AI Crawler |
| ChatGPT-User | ChatGPT-User | OpenAI | AI Search |
| OAI-SearchBot | OAI-SearchBot | OpenAI | AI Search |
| ClaudeBot | ClaudeBot | Anthropic | AI Crawler |
| anthropic-ai | anthropic-ai | Anthropic | AI Crawler |
| Google-Extended | Google-Extended | AI Crawler | |
| PerplexityBot | PerplexityBot | Perplexity | AI Search |
| CCBot | CCBot | Common Crawl | AI Crawler |
| Bytespider | Bytespider | ByteDance | AI Crawler |
| Diffbot | Diffbot | Diffbot | AI Crawler |
| cohere-ai | cohere-ai | Cohere | AI Crawler |
| Amazonbot | Amazonbot | Amazon | AI Crawler |
| Meta-ExternalAgent | Meta-ExternalAgent | Meta | AI Crawler |
| Meta-ExternalFetcher | Meta-ExternalFetcher | Meta | AI Crawler |
| Applebot-Extended | Applebot-Extended | Apple | AI Crawler |
| YouBot | YouBot | You.com | AI Search |
| Timpibot | Timpibot | Timpi | AI Crawler |
| img2dataset | img2dataset | Open Source | AI Crawler |
When asked to analyze a robots.txt:
When asked to generate a robots.txt:
User-agent: * with Allow: / as the defaultAlways format the generated robots.txt in a code block with syntax highlighting. Add comments explaining what each section does. Example:
# Allow all crawlers by default
User-agent: *
Allow: /
# Block AI training crawlers
User-agent: GPTBot
Disallow: /
User-agent: ClaudeBot
Disallow: /
# Sitemap
Sitemap: https://example.com/sitemap.xml