From 7abbbbacc978009aa228ad2ab355d9f86b8d0270 Mon Sep 17 00:00:00 2001 From: Rohan Kumar Date: Thu, 22 Feb 2024 00:05:10 -0500 Subject: [PATCH] Add greppr search engine --- content/posts/search-engines-with-own-indexes.gmi | 3 +++ content/posts/search-engines-with-own-indexes.md | 3 +++ 2 files changed, 6 insertions(+) diff --git a/content/posts/search-engines-with-own-indexes.gmi b/content/posts/search-engines-with-own-indexes.gmi index f2e18d0..c72434c 100644 --- a/content/posts/search-engines-with-own-indexes.gmi +++ b/content/posts/search-engines-with-own-indexes.gmi @@ -95,6 +95,9 @@ These engines pass most of the tests listed in the "methodology" section. All of => https://github.com/RimoChan/sese-engine SeSe back-end Python code => https://github.com/YunYouJun/sese-engine-ui SeSe-UI Vue-based front-end +* greppr: its tagline is "Search the Internet with no filters, no tracking, no ads." At the time of writing, it has over 3 million pages indexed. It's surprisingly good at finding interesting new results for broad short-tail queries, if you're willing to scroll far enough down the page. It appears to be good at finding recent pages. +=> https://greppr.org/ greppr + Yep supports Open Graph and some JSON-LD at the moment. A look through the source code for Alexandria and Gigablast didn't seem to reveal the use of any structured data. The surprising quality of results from SeSe and Right Dao seems influenced by the crawlers' high-quality starting locations (e.g. Wikipedia). ### Smaller indexes, hit-and-miss diff --git a/content/posts/search-engines-with-own-indexes.md b/content/posts/search-engines-with-own-indexes.md index dd34a5d..2b65ff7 100644 --- a/content/posts/search-engines-with-own-indexes.md +++ b/content/posts/search-engines-with-own-indexes.md @@ -123,6 +123,9 @@ These engines pass most of the tests listed in the "methodology" section. All of [SeSe Engine](https://sese.yyj.moe/) : Although it's a Chinese engine, its index seems to have a large-enough proportion of English content to fit here. The engine is open-source; see the [SeSe back-end Python code](https://github.com/RimoChan/sese-engine) and [the SeSe-ui Vue-based front-end](https://github.com/YunYouJun/sese-engine-ui). It has surprisingly good results for such a low-budget project. Each result is annotated with detailed ranking metadata such as keyword relevance and backlink weight. Discovered in my access logs. +[greppr](https://greppr.org/) +: Its tagline is "Search the Internet with no filters, no tracking, no ads." At the time of writing, it has over 3 million pages indexed. It's surprisingly good at finding interesting new results for broad short-tail queries, if you're willing to scroll far enough down the page. It appears to be good at finding recent pages. + Yep supports Open Graph and some JSON-LD at the moment. A look through the source code for Alexandria and Gigablast didn't seem to reveal the use of any structured data. The surprising quality of results from SeSe and Right Dao seems influenced by the crawlers' high-quality starting location: Wikipedia. ### Smaller indexes, hit-and-miss {#smaller-indexes-hit-and-miss}