From a5bd284644f8e1d7828ba2ce2b6a0177972869f8 Mon Sep 17 00:00:00 2001 From: Rohan Kumar Date: Sun, 21 Apr 2024 23:30:01 -0400 Subject: [PATCH] Add Clew search engine --- content/posts/search-engines-with-own-indexes.gmi | 3 +++ content/posts/search-engines-with-own-indexes.md | 3 +++ 2 files changed, 6 insertions(+) diff --git a/content/posts/search-engines-with-own-indexes.gmi b/content/posts/search-engines-with-own-indexes.gmi index 3fb761f..02301a0 100644 --- a/content/posts/search-engines-with-own-indexes.gmi +++ b/content/posts/search-engines-with-own-indexes.gmi @@ -217,6 +217,9 @@ These indexing search engines don’t have a Google-like “ask me anything” e => http://teclis.com/ Teclis => https://kagifeedback.org/d/1838-teclis-is-broken Teclis free version shutdown notice +* Clew: a FOSS new engine with a small index of several thousand pages. It focuses on independent content and downranks ads and trackers; there seems to be a real focus on quality over quantity, which makes it excellent for short-tail searches (especially around technical concepts). Ranking is more egalitarian than other engines, making it better for discovery and surfing than research. It's designed to be small and lightweight, with a compact index. Discovered in the seirdy.one access logs. +=> https://clew.se/ Clew + ### Site finders These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites. diff --git a/content/posts/search-engines-with-own-indexes.md b/content/posts/search-engines-with-own-indexes.md index 78f35e6..fa8e0cc 100644 --- a/content/posts/search-engines-with-own-indexes.md +++ b/content/posts/search-engines-with-own-indexes.md @@ -237,6 +237,9 @@ These indexing search engines don’t have a Google-like “ask me anything” e [Teclis](http://teclis.com/) : A project by the creator of Kagi search. Uses its own crawler that measures content blocked by uBlock Origin, and extracts content with the open-source article scrapers Trafilatura and Readability.js. This is quite an interesting approach: tracking blocked elements discourages tracking and advertising; using Trafilatura and Readability.js encourages the use of semantic HTML and Semantic Web standards such as [microformats](https://microformats.org/), [microdata](https://html.spec.whatwg.org/multipage/microdata.html), and [RDFa](https://www.w3.org/TR/rdfa-primer/). It claims to also use some results from Marginalia. [The Web interface has been shut down](https://kagifeedback.org/d/1838-teclis-is-broken/2), but its standalone API is still available for Kagi customers. +[Clew](https://clew.se/) +: a FOSS new engine with a small index of several thousand pages. It focuses on independent content and downranks ads and trackers; there seems to be a real focus on quality over quantity, which makes it excellent for short-tail searches (especially around technical concepts). Ranking is more egalitarian than other engines, making it better for discovery and surfing than research. It's designed to be small and lightweight, with a compact index. Discovered in the seirdy.one access logs. + ### Site finders These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites.