1
0
Fork 0
mirror of https://git.sr.ht/~seirdy/seirdy.one synced 2024-11-14 01:32:11 +00:00

Add Clew search engine

This commit is contained in:
Rohan Kumar 2024-04-21 23:30:01 -04:00
parent 7d68eb182a
commit a5bd284644
No known key found for this signature in database
GPG key ID: 1E892DB2A5F84479
2 changed files with 6 additions and 0 deletions

View file

@ -217,6 +217,9 @@ These indexing search engines dont have a Google-like “ask me anything” e
=> http://teclis.com/ Teclis => http://teclis.com/ Teclis
=> https://kagifeedback.org/d/1838-teclis-is-broken Teclis free version shutdown notice => https://kagifeedback.org/d/1838-teclis-is-broken Teclis free version shutdown notice
* Clew: a FOSS new engine with a small index of several thousand pages. It focuses on independent content and downranks ads and trackers; there seems to be a real focus on quality over quantity, which makes it excellent for short-tail searches (especially around technical concepts). Ranking is more egalitarian than other engines, making it better for discovery and surfing than research. It's designed to be small and lightweight, with a compact index. Discovered in the seirdy.one access logs.
=> https://clew.se/ Clew
### Site finders ### Site finders
These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites. These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites.

View file

@ -237,6 +237,9 @@ These indexing search engines dont have a Google-like “ask me anything” e
[Teclis](http://teclis.com/) [Teclis](http://teclis.com/)
: A project by the creator of Kagi search. Uses its own crawler that measures content blocked by uBlock Origin, and extracts content with the open-source article scrapers Trafilatura and Readability.js. This is quite an interesting approach: tracking blocked elements discourages tracking and advertising; using Trafilatura and Readability.js encourages the use of semantic HTML and Semantic Web standards such as [microformats](https://microformats.org/), [microdata](https://html.spec.whatwg.org/multipage/microdata.html), and [RDFa](https://www.w3.org/TR/rdfa-primer/). It claims to also use some results from Marginalia. [The Web interface has been shut down](https://kagifeedback.org/d/1838-teclis-is-broken/2), but its standalone API is still available for Kagi customers. : A project by the creator of Kagi search. Uses its own crawler that measures content blocked by uBlock Origin, and extracts content with the open-source article scrapers Trafilatura and Readability.js. This is quite an interesting approach: tracking blocked elements discourages tracking and advertising; using Trafilatura and Readability.js encourages the use of semantic HTML and Semantic Web standards such as [microformats](https://microformats.org/), [microdata](https://html.spec.whatwg.org/multipage/microdata.html), and [RDFa](https://www.w3.org/TR/rdfa-primer/). It claims to also use some results from Marginalia. [The Web interface has been shut down](https://kagifeedback.org/d/1838-teclis-is-broken/2), but its standalone API is still available for Kagi customers.
[Clew](https://clew.se/)
: a FOSS new engine with a small index of several thousand pages. It focuses on independent content and downranks ads and trackers; there seems to be a real focus on quality over quantity, which makes it excellent for short-tail searches (especially around technical concepts). Ranking is more egalitarian than other engines, making it better for discovery and surfing than research. It's designed to be small and lightweight, with a compact index. Discovered in the seirdy.one access logs.
### Site finders ### Site finders
These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites. These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites.