1
0
Fork 0
mirror of https://git.sr.ht/~seirdy/seirdy.one synced 2024-09-19 20:02:10 +00:00

Add greppr search engine

This commit is contained in:
Rohan Kumar 2024-02-22 00:05:10 -05:00
parent 6a93bfa103
commit 7abbbbacc9
No known key found for this signature in database
GPG key ID: 1E892DB2A5F84479
2 changed files with 6 additions and 0 deletions

View file

@ -95,6 +95,9 @@ These engines pass most of the tests listed in the "methodology" section. All of
=> https://github.com/RimoChan/sese-engine SeSe back-end Python code
=> https://github.com/YunYouJun/sese-engine-ui SeSe-UI Vue-based front-end
* greppr: its tagline is "Search the Internet with no filters, no tracking, no ads." At the time of writing, it has over 3 million pages indexed. It's surprisingly good at finding interesting new results for broad short-tail queries, if you're willing to scroll far enough down the page. It appears to be good at finding recent pages.
=> https://greppr.org/ greppr
Yep supports Open Graph and some JSON-LD at the moment. A look through the source code for Alexandria and Gigablast didn't seem to reveal the use of any structured data. The surprising quality of results from SeSe and Right Dao seems influenced by the crawlers' high-quality starting locations (e.g. Wikipedia).
### Smaller indexes, hit-and-miss

View file

@ -123,6 +123,9 @@ These engines pass most of the tests listed in the "methodology" section. All of
[SeSe Engine](https://sese.yyj.moe/)
: Although it's a Chinese engine, its index seems to have a large-enough proportion of English content to fit here. The engine is open-source; see the [SeSe back-end Python code](https://github.com/RimoChan/sese-engine) and [the SeSe-ui Vue-based front-end](https://github.com/YunYouJun/sese-engine-ui). It has surprisingly good results for such a low-budget project. Each result is annotated with detailed ranking metadata such as keyword relevance and backlink weight. Discovered in my access logs.
[greppr](https://greppr.org/)
: Its tagline is "Search the Internet with no filters, no tracking, no ads." At the time of writing, it has over 3 million pages indexed. It's surprisingly good at finding interesting new results for broad short-tail queries, if you're willing to scroll far enough down the page. It appears to be good at finding recent pages.
Yep supports Open Graph and some JSON-LD at the moment. A look through the source code for Alexandria and Gigablast didn't seem to reveal the use of any structured data. The surprising quality of results from SeSe and Right Dao seems influenced by the crawlers' high-quality starting location: Wikipedia.
### Smaller indexes, hit-and-<wbr />miss {#smaller-indexes-hit-and-miss}