From 5612220b77ab9e2eb4b4695f6b0e8dbc7affb4d5 Mon Sep 17 00:00:00 2001 From: Rohan Kumar Date: Fri, 25 Feb 2022 23:43:22 -0800 Subject: [PATCH] Mention exclusions --- content/posts/search-engines-with-own-indexes.gmi | 8 +++++++- content/posts/search-engines-with-own-indexes.md | 11 +++++++++-- 2 files changed, 16 insertions(+), 3 deletions(-) diff --git a/content/posts/search-engines-with-own-indexes.gmi b/content/posts/search-engines-with-own-indexes.gmi index 6e5c7f2..3f8753a 100644 --- a/content/posts/search-engines-with-own-indexes.gmi +++ b/content/posts/search-engines-with-own-indexes.gmi @@ -82,7 +82,7 @@ These are large engines that pass all the above tests and more. ### Smaller indexes, relevant results -These engines pass most of the tests listed in the “methodology” section. +These engines pass most of the tests listed in the "methodology" section. All of them seem relatively privacy-friendly. * Right Dao: very fast, good results. Passes the tests fairly well. It plans on including query-based ads if/when its userbase grows.⁸ @@ -291,6 +291,12 @@ These engines were originally included in the article, but have since been disco => https://xangis.com/the-wbsrch-experiment/ The Wbsrch Experiment => https://gowiki.com Gowiki +## Exclusions + +Two engines were excluded from this list for having a far-right focus. + +One engine was excluded because it seems to be built using cryptocurrency in a way I'd rather not support. + ## Acknowledgements Some of this content came from the Search Engine Map and Search Engine Party. A few web directories also proved useful. diff --git a/content/posts/search-engines-with-own-indexes.md b/content/posts/search-engines-with-own-indexes.md index 732f5cc..87a0d78 100644 --- a/content/posts/search-engines-with-own-indexes.md +++ b/content/posts/search-engines-with-own-indexes.md @@ -90,7 +90,7 @@ These are large engines that pass all the above tests and more. ### Smaller indexes, relevant results -These engines pass most of the tests listed in the "methodology" section. +These engines pass most of the tests listed in the "methodology" section. All of them seem relatively privacy-friendly. - [Right Dao](https://rightdao.com): very fast, good results. Passes the tests fairly well. It plans on including query-based ads if/when its user base grows.[^8] - [Gigablast](https://gigablast.com/): It's been around for a while and also sports a classic web directory. Searches are a bit slow, and it charges to submit sites for crawling. It powers [Private.sh](https://private.sh). Gigablast is tied with Right Dao for quality. @@ -142,7 +142,7 @@ These indexing search engines don’t have a Google-like “ask me anything” e ### Small or non-commercial Web - Wiby: [wiby.me](https://wiby.me) and [wiby.org](https://wiby.org): I love this one. It focuses on smaller independent sites that capture the spirit of the "early" web. It's more focused on "discovering" new interesting pages that match a set of keywords than finding a specific resources. I like to think of Wiby as an engine for surfing, not searching. Runnaroo occasionally features a hit from Wiby. If you have a small site or blog that isn't very "commercial", consider submitting it to the index. -- [Marginalia Search](https://search.marginalia.nu/): A recent addition similar to Wiby, and *my favorite entry on this page*. It has its own crawler but is strongly biased towards non-commercial, personal, and/or minimal sites. It's a great response to the increasingly SEO-spam-filled SERPs of GBY. Partially powers Teclis, which in turn partially powers Kagi. +- [Marginalia Search](https://search.marginalia.nu/): A recent addition similar to Wiby, and _my favorite entry on this page_. It has its own crawler but is strongly biased towards non-commercial, personal, and/or minimal sites. It's a great response to the increasingly SEO-spam-filled SERPs of GBY. Partially powers Teclis, which in turn partially powers Kagi. - [Search My Site](https://searchmysite.net): Similar to Wiby, but only indexes user-submitted personal and independent sites. It optionally supports IndieAuth. - [Teclis](http://teclis.com/): A project by the creator of Kagi search. Uses its own crawler that measures content blocked by uBlock Origin, and extracts content with the open-source article scrapers Trafilatura and Readability.js. This is quite an interesting approach: tracking blocked elements discourages tracking and advertising; using Trafilatura and Readability.js encourages the use of semantic HTML and Semantic Web standards such as [microformats](https://microformats.org/), [microdata](https://html.spec.whatwg.org/multipage/microdata.html), and [RDFa](https://www.w3.org/TR/rdfa-primer/). It claims to also use some results from Marginalia. @@ -204,6 +204,13 @@ These engines were originally included in the article, but have since been disco - [wbsrch](https://wbsrch.com/): In addition to its generalist search, it also had many other utilities related to domain name statistics. Failed multiple tests. Its index was a bit dated; it had an old backlog of sites it hadn't finished indexing. It also had several dedicated per-language indexes. - [Gowiki](https://gowiki.com): Very young, small index, but showed promise. I discovered this in the seirdy.one access logs. It was only available in the US. Seems down as of early 2022. +Exclusions +---------- + +Two engines were excluded from this list for having a far-right focus. + +One engine was excluded because it seems to be built using cryptocurrency in a way I'd rather not support. + Acknowledgements ----------------