syndicate

2025-05-17 20:43:51 +00:00 · 2023-04-21 22:45:43 -07:00 · 2023-04-21 22:45:43 -07:00 · e4592387a3
commit e4592387a3
parent 2a8d60b896
1 changed files with 3 additions and 3 deletions
--- a/content/notes/opting-out-of-llm-indexing.md
+++ b/content/notes/opting-out-of-llm-indexing.md
@ -6,9 +6,9 @@ replyTitle: "“the secret list of websites”"
 replyType: "BlogPosting"
 replyAuthor: "Chris Coyier"
 replyAuthorURI: "https://chriscoyier.net/"
-#syndicatedCopies:
+syndicatedCopies:
-#    - title: 'The Fediverse'
+    - title: 'The Fediverse'
-#      url: ''
+      url: 'https://pleroma.envs.net/notice/AUttq9kpOmeYZDHRTc'
 ---
 I added an entry to [my robots.txt](https://seirdy.one/robots.txt) to block ChatGPT's crawler, but blocking crawling isn't the same as blocking indexing; it looks like Google chose to use the [Common Crawl](https://commoncrawl.org/) for this and sidestep the need to do crawling of its own. That's a strange decision; after all, Google has a much larger proprietary index at its disposal.