1
0
Fork 0
mirror of https://git.sr.ht/~seirdy/seirdy.one synced 2024-11-23 12:52:10 +00:00

Compare commits

..

3 commits

Author SHA1 Message Date
Seirdy
fcbd50e7ab
Add humans.txt 2024-06-01 06:05:32 -04:00
Seirdy
4f28f001bf
robots.txt: remove unused anthropic directives
official docs show the right opt-out signal
2024-06-01 05:35:15 -04:00
Seirdy
03270da3c7
Move mirrors.txt from static to content dir 2024-06-01 03:06:36 -04:00
6 changed files with 24 additions and 9 deletions

BIN
assets/p/b/humanstxt.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 733 B

15
content/humans.txt Normal file
View file

@ -0,0 +1,15 @@
/* TEAM */
Webmaster: Plural system of Seirdy, Cadmus, AKA Rohan Kumar.
Contact: see /about/
/* THANKS */
Daniel Aleksandersen's <https://www.ctrl.blog> for Microdata usage.
Athena Lilith Martin for more markup inspiration.
The IndieWeb community for showing me the power of personal sites (Aaron Parecki in particular)
/* SITE */
Language: US English
Doctype: Polygot, HTML5 or XHTML5
Standards: HTML5, CSS3, WAI-ARIA, DPUB-ARIA, Atom, varius RSS/Atom extensions, Microformats2, Open Graph Protocol
Components: Handwritten XHTML5 and CSS, Hugo, Nginx. More info at <https://seirdy.one/meta/build-this-site/#dependencies> and <https://seirdy.one/meta/#how-i-run-this-site>
Site design standards: <https://seirdy.one/meta/site-design/>

View file

@ -59,6 +59,7 @@ Read more about the design of this site in my [site design standards page]({{<re
<a href="https://validator.w3.org/feed/check.cgi?url=https%3A%2F%2Fseirdy.one%2Fatom.xml">{{<picture name="b/valid-atom" alt="Valid Atom feed." width="162" height="62" class="pix">}}</a>
<a href="https://validator.w3.org/feed/check.cgi?url=https%3A%2F%2Fseirdy.one%2Findex.xml">{{<picture name="b/valid-rss" alt="Valid RSS feed." width="162" height="62" class="pix">}}</a>
<a href="{{<relref "/meta/wcag-conformance.md">}}">{{<picture name="b/wcag22" alt="W3C WAI-AA WCAG 2.2." width="162" height="62" class="pix">}}</a>
<a href="https://web.archive.org/web/20240507225939/https://humanstxt.org/">{{<picture name="b/humanstxt" alt="humans dot TXT" width="162" height="62" class="pix">}}</a>
### Software that seirdy.one runs on
@ -187,6 +188,7 @@ The following badges come from the sites they link to:
- Atom and RSS validation badges
- Web-14 censorship panda
- AGPLv3 license
- humans.txt
- Say no to Web3
- Web Interoperability Pledge
- Standards NOW!

View file

@ -45,6 +45,11 @@ These are roughly in descending order of importance:
## Improvements already made
### June 2024
- Added support for [No-Vary-Search](https://wicg.github.io/nav-speculation/no-vary-search.html)
- Added humans.txt.
### May 2024
- Wrapped `details` elements in borders to distinguish them in print media and to emphasize that the contents can be toggled.

View file

@ -11,8 +11,7 @@ Disallow: /
Allow: /ads.txt
Allow: /app-ads.txt
# Enabling our crawler to access your site offers several significant benefits
# to you as a publisher. By allowing us access, you enable the maximum number
# By allowing us access, you enable the maximum number
# of advertisers to confidently purchase advertising space on your pages. Our
# comprehensive data insights help advertisers understand the suitability and
# context of your content, ensuring that their ads align with your audience's
@ -100,13 +99,7 @@ Disallow: /
User-agent: Google-Extended
Disallow: /
# There isn't any public documentation for this AFAICT.
# Reuters thinks this works so I might as well give it a shot.
User-agent: anthropic-ai
User-agent: Claude-Web
Disallow: /
# Extremely aggressive crawling with no documentation. people had to email the
# company about this for robots.txt guidance.
# Anthropic-AI crawler posted guidance after a long period of crawling without opt-out documentation: <https://support.anthropic.com/en/articles/8896518-does-anthropic-crawl-data-from-the-web-and-how-can-site-owners-block-the-crawler>
User-agent: ClaudeBot
Disallow: /