diff --git a/content/posts/search-engines-with-own-indexes.gmi b/content/posts/search-engines-with-own-indexes.gmi index a01cd7b..3661bf8 100644 --- a/content/posts/search-engines-with-own-indexes.gmi +++ b/content/posts/search-engines-with-own-indexes.gmi @@ -63,6 +63,7 @@ These are large engines that pass all my standard tests and more. * Netzzappen * You.com¹¹ * Partially powers MetaGer by default; this can be turned off +* ChatGPT Search * At this point, I mostly stopped adding Bing-based search engines. There are just too many. 3. Yandex: originally a Russian search engine, it now has an English version. Some Russian results bleed into its English site. Like Bing, it allows submitting pages and sitemaps for crawling using the IndexNow API. Powers: @@ -132,7 +133,7 @@ These engines fail badly at a few important tests. Otherwise, they seem to work * Secret Search Engine Labs: Very small index with very little SEO spam; it toes the line between a "search engine" and a "surf engine". It's best for reading about broad topics that would otherwise be dominated by SEO spam, thanks to its CashRank algorithm. Allows site submission. * Gabanza: a search engine from a hosting company. I found few details abou the search engine itself, and the index was small, but it was suitable for discovering new pages related to short broad queries. * Jambot: docs, blog posts, etc. have not been updated since around 2006 but the engine continues to crawl and index new pages. Discovered in my access logs. Has a bias towards older content. -* search.dxhub.de: while Gigablast seems dead, a version of it was open-source. This based on that version of Gigablast. Its index is small but results are still useful for surfing new unseen corners of short-tail queries. +* search.dxhub.de: while Gigablast seems dead, a version of it was open-source. This based on that version of Gigablast. Its index is small but results are still useful for surfing new unseen corners of short-tail queries. Found via my access logs. => https://github.com/chatnoir-eu ChatNoir source code (GitHub) => https://groups.google.com/g/common-crawl/c/3o2dOHpeRxo/m/H2Osqz9dAAAJ ChatNoir Announcement diff --git a/content/posts/search-engines-with-own-indexes.md b/content/posts/search-engines-with-own-indexes.md index 991da6b..15967a0 100644 --- a/content/posts/search-engines-with-own-indexes.md +++ b/content/posts/search-engines-with-own-indexes.md @@ -94,6 +94,7 @@ Bing - Netzzappen - You.com[^7] - Partially powers MetaGer by default; this can be turned off + - [ChatGPT Search](https://help.openai.com/en/articles/9237897-chatgpt-search) - At this point, I mostly stopped adding Bing-based search engines. There are just too many. Yandex @@ -168,7 +169,7 @@ These engines fail badly at a few important tests. Otherwise, they seem to work : Docs, blog posts, etc. have not been updated since around 2006 but the engine continues to crawl and index new pages. Discovered in my access logs. Has a bias towards older content. [search.dxhub.de](http://search.dxhub.de/?c=main) -: while Gigablast seems dead, a version of it was open-source. This based on that version of Gigablast. Its index is small but results are still useful for surfing new unseen corners of short-tail queries. +: while Gigablast seems dead, a version of it was open-source. This based on that version of Gigablast. Its index is small but results are still useful for surfing new unseen corners of short-tail queries. Found via my access logs. ### Fledgling engines