mirror of
https://git.sr.ht/~seirdy/seirdy.one
synced 2024-12-24 17:52:11 +00:00
robots.txt: remove unused anthropic directives
official docs show the right opt-out signal
This commit is contained in:
parent
03270da3c7
commit
4f28f001bf
1 changed files with 2 additions and 9 deletions
|
@ -11,8 +11,7 @@ Disallow: /
|
|||
Allow: /ads.txt
|
||||
Allow: /app-ads.txt
|
||||
|
||||
# Enabling our crawler to access your site offers several significant benefits
|
||||
# to you as a publisher. By allowing us access, you enable the maximum number
|
||||
# By allowing us access, you enable the maximum number
|
||||
# of advertisers to confidently purchase advertising space on your pages. Our
|
||||
# comprehensive data insights help advertisers understand the suitability and
|
||||
# context of your content, ensuring that their ads align with your audience's
|
||||
|
@ -100,13 +99,7 @@ Disallow: /
|
|||
User-agent: Google-Extended
|
||||
Disallow: /
|
||||
|
||||
# There isn't any public documentation for this AFAICT.
|
||||
# Reuters thinks this works so I might as well give it a shot.
|
||||
User-agent: anthropic-ai
|
||||
User-agent: Claude-Web
|
||||
Disallow: /
|
||||
# Extremely aggressive crawling with no documentation. people had to email the
|
||||
# company about this for robots.txt guidance.
|
||||
# Anthropic-AI crawler posted guidance after a long period of crawling without opt-out documentation: <https://support.anthropic.com/en/articles/8896518-does-anthropic-crawl-data-from-the-web-and-how-can-site-owners-block-the-crawler>
|
||||
User-agent: ClaudeBot
|
||||
Disallow: /
|
||||
|
||||
|
|
Loading…
Reference in a new issue