1
0
Fork 0
mirror of https://git.sr.ht/~seirdy/seirdy.one synced 2024-11-23 21:02:09 +00:00

Compare commits

..

No commits in common. "47ae5394565a44debeeb1b3e092969c7d2b80b2b" and "e4e020649d12ee3879c4b26e06e6b25da1a713ab" have entirely different histories.

9 changed files with 6 additions and 15 deletions

Binary file not shown.

Before

Width:  |  Height:  |  Size: 625 B

Binary file not shown.

Before

Width:  |  Height:  |  Size: 2.7 KiB

Binary file not shown.

Before

Width:  |  Height:  |  Size: 2.6 KiB

Binary file not shown.

Before

Width:  |  Height:  |  Size: 382 B

Binary file not shown.

Before

Width:  |  Height:  |  Size: 1.8 KiB

Binary file not shown.

Before

Width:  |  Height:  |  Size: 1.3 KiB

View file

@ -48,7 +48,6 @@ Read more about the design of this site in my [site design standards page]({{<re
{{<picture name="b/web11" alt="Web 1.1." width="162" height="62" class="pix">}}
{{<picture name="b/is_it_slow_say_so" alt="Is it slow? Say so!" width="162" height="62" class="pix">}}
{{<picture name="b/dark-mode" alt="Made for Dark Mode!" width="162" height="62" class="pix">}}
{{<picture name="b/mobilefriendly" alt="Mobile friendly." width="162" height="62" class="pix">}}
<a href="https://www.w3.org/developers/tools/">{{<picture name="b/heartvalidator" alt="I heart validator." width="162" height="62" class="pix">}}</a>
<a href="https://git.sr.ht/~seirdy/seirdy.one/tree/master/item/linter-configs/vnu_filter.jq">{{<picture name="b/html5" alt="W3C valid HTML5." width="162" height="62" class="pix">}}</a>
<a href="https://git.sr.ht/~seirdy/seirdy.one/tree/master/item/linter-configs/vnu_filter.jq">{{<picture name="b/valid-css" alt="W3C valid CSS." width="162" height="62" class="pix">}}</a>
@ -66,7 +65,6 @@ Read more about the design of this site in my [site design standards page]({{<re
<a href="https://nginx.org/">{{<picture name="b/nginx" alt="Nginx powered." width="162" height="62" class="pix">}}</a>
<a href="https://llvm.org/">{{<picture name="b/llvm" alt="LLVM compiler infrastructure." width="162" height="62" class="pix">}}</a>
<a href="https://www.torproject.org/">{{<picture name="b/tor" alt="The Tor Project." width="162" height="62" class="pix">}}</a>
<a href="https://gohugo.io/">{{<picture name="b/hugo" alt="Logo for the Hugo static site generator." width="162" height="62" class="pix">}}</a>
### Notes on "About this site" badges
@ -115,9 +113,8 @@ See [my "uses" page]({{<relref "/about/uses.md">}}) for some more information.
<a href="https://archive.org/?noscript=true">{{<picture name="b/internetarchive" alt="Internet Archive." width="162" height="62" class="pix">}}</a>
<a href="https://wiby.me/about/">{{<picture name="b/wiby" alt="Wiby.me search." width="162" height="62" class="pix">}}</a>
<a href="https://neovim.io/">{{<picture name="b/neovim" alt="Made with NeoVim." width="162" height="62" class="pix">}}</a>
<a href="https://github.com/yt-dlp/yt-dlp">{{<picture name="b/yt-dl" alt="The word “youtube-dl” flashes on a command prompt." width="162" height="62" class="pix">}}</a>
<a href="https://matrix.org/">{{<picture name="b/matrix" alt="Matrix." width="162" height="62" class="pix">}}</a>
<a href="https://github.com/gorhill/uBlock">{{<picture name="b/ublock-origin" alt="uBlock Origin now!" width="162" height="62" class="pix">}}</a>
<a href="https://github.com/yt-dlp/yt-dlp">{{<picture name="b/yt-dl" alt="The word “youtube-dl” flashes on a command prompt" width="162" height="62" class="pix">}}</a>
<a href="https://matrix.org/">{{<picture name="b/matrix" alt="Matrix" width="162" height="62" class="pix">}}</a>
## Friendly sites
@ -143,7 +140,6 @@ Sites that link back to me:
### Other sites
<a href="https://eightyeightthirty.one/">{{<picture name="b/eightyeightthirtyone" alt="eighty eight thirty dot one." width="162" height="62" class="pix">}}</a>
<a href="https://yesterweb.org/">{{<picture name="b/yesterweb" alt="YesterWeb: reclaim the net." width="162" height="62" class="pix">}}</a>
<a href="https://xkcd.com/">{{<picture name="b/xkcd" alt="xkcd." width="162" height="62" class="pix">}}</a>

View file

@ -121,24 +121,22 @@ These engines fail badly at a few important tests. Otherwise, they seem to work
* Burf.co: Very small index, but seems fine at ranking more relevant results higher. Allows site submission without any extra steps.
* Siik: Lacks contact info, and the ToS and Privacy Policy links are dead. Seems to have PHP errors in the backend for some of its instant-answer widgets. If you scroll past all that, it does have web results powered by what seems to be its own index. These results do tend to be somewhat relevant, but the index seems too small for more specific queries.
* ChatNoir: An experimental engine by researchers that uses the Common Crawl index. The engine is open source. There's more information in its announcement on the Common Crawl mailing list (Google Groups).
=> https://www.chatnoir.eu/ ChatNoir
=> https://commoncrawl.org/ Common Crawl
=> https://burf.co/ Burf.co
=> https://siik.co/ Siik
=> https://inetdex.com inetdex.com
* ChatNoir: An experimental engine by researchers that uses the Common Crawl index. The engine is open source. There's more information in its announcement on the Common Crawl mailing list (Google Groups).
* Secret Search Engine Labs: Very small index with very little SEO spam; it toes the line between a "search engine" and a "surf engine". It's best for reading about broad topics that would otherwise be dominated by SEO spam, thanks to its CashRank algorithm. Allows site submission.
* Gabanza: a search engine from a hosting company. I found few details abou the search engine itself, and the index was small, but it was suitable for discovering new pages related to short broad queries.
* Jambot: docs, blog posts, etc. have not been updated since around 2006 but the engine continues to crawl and index new pages. Discovered in my access logs. Has a bias towards older content.
=> https://www.chatnoir.eu/ ChatNoir
=> https://commoncrawl.org/ Common Crawl
=> https://github.com/chatnoir-eu ChatNoir source code (GitHub)
=> https://groups.google.com/g/common-crawl/c/3o2dOHpeRxo/m/H2Osqz9dAAAJ ChatNoir Announcement
=> http://www.secretsearchenginelabs.com/ Secret Search Engine Labs
=> http://www.secretsearchenginelabs.com/tech/cashrank.php CashRank Algorithm
=> https://www.gabanza.com/ Gabanza - The new search engine.
=> https://jambot.com/ Jambot
### Unusable engines, irrelevant results

View file

@ -116,7 +116,7 @@ These engines pass most of the tests listed in the "methodology" section. All of
: **My favorite generalist engine on this page.** Stract supports advanced ranking customization by allowing users ti import "optics" files, like a better version of Brave's "goggles" feature. [Stract is fully open-source](https://github.com/StractOrg/stract), with code released under an AGPL-3.0 license. The index is isn't massive but it's big enough to be a useful supplement to more major engines. Stract started with the Common Crawl index, but now uses its own crawler. Plans to add contextual ads and a subscription option for ad-free search. Discovered in my access logs.
[Right Dao](https://rightdao.com)
: Very fast, good results. Passes the tests fairly well. It plans on including query-based ads if/when its user base grows.[^8] For the past few months, its index seems to have focused more on large, established sites rather than smaller, independent ones. It seems to be a bit lacking in more recent pages.
: Very fast, good results. Passes the tests fairly well. It plans on including query-based ads if/when its user base grows.[^8]
[Alexandria](https://www.alexandria.org/)
: A pretty new "non-profit, ad free" engine, with [freely-licensed code](https://github.com/alexandria-org/alexandria). Surprisingly good at finding recent pages. Its index is built from the Common Crawl; it isn't as big as Gigablast or Right Dao but its ranking is great.
@ -164,9 +164,6 @@ These engines fail badly at a few important tests. Otherwise, they seem to work
[Gabanza](https://www.gabanza.com/)
: A search engine from a hosting company. I found few details abou the search engine itself, and the index was small, but it was suitable for discovering new pages related to short broad queries.
[Jambo](https://jambot.com/)
: Docs, blog posts, etc. have not been updated since around 2006 but the engine continues to crawl and index new pages. Discovered in my access logs. Has a bias towards older content.
### Fledgling engines
Results from these search engines don't seem particularly relevant; indexes in this category tend to be small.