mirror of
https://git.sr.ht/~seirdy/seirdy.one
synced 2024-11-23 12:52:10 +00:00
Add Clew search engine
This commit is contained in:
parent
7d68eb182a
commit
a5bd284644
2 changed files with 6 additions and 0 deletions
|
@ -217,6 +217,9 @@ These indexing search engines don’t have a Google-like “ask me anything” e
|
||||||
=> http://teclis.com/ Teclis
|
=> http://teclis.com/ Teclis
|
||||||
=> https://kagifeedback.org/d/1838-teclis-is-broken Teclis free version shutdown notice
|
=> https://kagifeedback.org/d/1838-teclis-is-broken Teclis free version shutdown notice
|
||||||
|
|
||||||
|
* Clew: a FOSS new engine with a small index of several thousand pages. It focuses on independent content and downranks ads and trackers; there seems to be a real focus on quality over quantity, which makes it excellent for short-tail searches (especially around technical concepts). Ranking is more egalitarian than other engines, making it better for discovery and surfing than research. It's designed to be small and lightweight, with a compact index. Discovered in the seirdy.one access logs.
|
||||||
|
=> https://clew.se/ Clew
|
||||||
|
|
||||||
### Site finders
|
### Site finders
|
||||||
|
|
||||||
These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites.
|
These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites.
|
||||||
|
|
|
@ -237,6 +237,9 @@ These indexing search engines don’t have a Google-like “ask me anything” e
|
||||||
[Teclis](http://teclis.com/)
|
[Teclis](http://teclis.com/)
|
||||||
: A project by the creator of Kagi search. Uses its own crawler that measures content blocked by uBlock Origin, and extracts content with the open-source article scrapers Trafilatura and Readability.js. This is quite an interesting approach: tracking blocked elements discourages tracking and advertising; using Trafilatura and Readability.js encourages the use of semantic HTML and Semantic Web standards such as [microformats](https://microformats.org/), [microdata](https://html.spec.whatwg.org/multipage/microdata.html), and [RDFa](https://www.w3.org/TR/rdfa-primer/). It claims to also use some results from Marginalia. [The Web interface has been shut down](https://kagifeedback.org/d/1838-teclis-is-broken/2), but its standalone API is still available for Kagi customers.
|
: A project by the creator of Kagi search. Uses its own crawler that measures content blocked by uBlock Origin, and extracts content with the open-source article scrapers Trafilatura and Readability.js. This is quite an interesting approach: tracking blocked elements discourages tracking and advertising; using Trafilatura and Readability.js encourages the use of semantic HTML and Semantic Web standards such as [microformats](https://microformats.org/), [microdata](https://html.spec.whatwg.org/multipage/microdata.html), and [RDFa](https://www.w3.org/TR/rdfa-primer/). It claims to also use some results from Marginalia. [The Web interface has been shut down](https://kagifeedback.org/d/1838-teclis-is-broken/2), but its standalone API is still available for Kagi customers.
|
||||||
|
|
||||||
|
[Clew](https://clew.se/)
|
||||||
|
: a FOSS new engine with a small index of several thousand pages. It focuses on independent content and downranks ads and trackers; there seems to be a real focus on quality over quantity, which makes it excellent for short-tail searches (especially around technical concepts). Ranking is more egalitarian than other engines, making it better for discovery and surfing than research. It's designed to be small and lightweight, with a compact index. Discovered in the seirdy.one access logs.
|
||||||
|
|
||||||
### Site finders
|
### Site finders
|
||||||
|
|
||||||
These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites.
|
These engines try to find a website, typically at the domain-name level. They don't focus on capturing particular pages within websites.
|
||||||
|
|
Loading…
Reference in a new issue