robots.txt: remove unused anthropic directives

official docs show the right opt-out signal
2025-05-17 20:43:51 +00:00 · 2024-06-01 05:35:15 -04:00 · 2024-06-01 05:35:15 -04:00 · 4f28f001bf
commit 4f28f001bf
parent 03270da3c7
1 changed files with 2 additions and 9 deletions
--- a/static/robots.txt
+++ b/static/robots.txt
@ -11,8 +11,7 @@ Disallow: /
 Allow: /ads.txt
 Allow: /app-ads.txt

-# Enabling our crawler to access your site offers several significant benefits
-# to you as a publisher. By allowing us access, you enable the maximum number
+# By allowing us access, you enable the maximum number
 # of advertisers to confidently purchase advertising space on your pages. Our
 # comprehensive data insights help advertisers understand the suitability and
 # context of your content, ensuring that their ads align with your audience's
@ -100,13 +99,7 @@ Disallow: /
 User-agent: Google-Extended
 Disallow: /

-# There isn't any public documentation for this AFAICT.
-# Reuters thinks this works so I might as well give it a shot.
-User-agent: anthropic-ai
-User-agent: Claude-Web
-Disallow: /
-# Extremely aggressive crawling with no documentation. people had to email the
-# company about this for robots.txt guidance.
+# Anthropic-AI crawler posted guidance after a long period of crawling without opt-out documentation: <https://support.anthropic.com/en/articles/8896518-does-anthropic-crawl-data-from-the-web-and-how-can-site-owners-block-the-crawler>
 User-agent: ClaudeBot
 Disallow: /