home.social

#marginaliasearch — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #marginaliasearch, aggregated by home.social.

  1. Using Marginalia Search’s Backlink Search

    Marginalia Search may well be the best open source independent search engine on the web. Here, we look at how it can be used to find backlinks to indexed domains.

    thenewleafjournal.com/using-ma

    #MarginaliaSearch #SearchEngines #Backlinks #SmallWeb

  2. Seems stract.com ils no longer operational, and the developper is no working on the project any longer...
    That's a pity 😥 , I liked the ideas behind this search engine, although the index was never big enough to give really relevant results.

    #marginaliasearch is an alternative, it seems.

    I'm not sure what it takes to run a web crawler... Or how big the index becomes..
    #yacy has a decentralized approach, not sure it works

    #searchengine #souverainetenumerique #foss #logiciellibre

  3. Do you use #mojeek, #presearch, #kagi, #marginaliasearch, #yacy or something else ??? There is no false answer! 😌

    Tell us why and let us discover new tools to search online! 🙂

    #websearch #websearchengine #boostswelcome #boost

  4. social.emucafe.org/naferrell/b

    I read an interesting Hacker News comment by user dumbfounder (great username!) on Hacker News today. I excerpt the pertinent part below:

    I created a search engine that crawled the web way back in 2003. I used a proper user agent that included my email address. I got SO many angry emails about my crawler, which played as nice as I was able to make it play. Which was pretty nice I believe. If it’s not Google people didn’t want it. That’s a good way to prevent anyone from ever competing with Google.

    dumbfounder

    I have had problems with bad crawlers (especially bad AI cralwers) on my sites. At the same time, dumbfounder highlights the reverse side of the coin. Many sites block good crawlers such as robots.txt-respecting crawlers for indepdent search engines. While all webmasters are free to control access to their sites as they see fit, allowing Google and other select big tech search crawlers while excluding small and independent search crawlers both limits search diversity and prevents people who may rely on small or niche search engines such as Mojeek, Marginalia, or Seznam from discovering potentially interesting writing. I previously published an article on this issue advocating for webmasters who want to support an open web and search engine diversity to ensure that good crawlers from independent search engines and directories can access their sites.

    #bots #googleSearch #marginaliaSearch #mojeek #searchEngines #seznam

  5. @abucci time for web sites to bring back browser wars era badges, but instead of saying "Best viewed with Netscape Navigator" it might say "Best found with Duck Duck Go".... Oh wait, they have an AI "assist" right at the top of their page too. Hmm

    "Best found with ${NON_AI_SEARCH_ENGINE}" where NON_AI_SEARCH_ENGINE is one of Start page, Searx, Qwant, Marginalia, Mwmbl...

    #Googe #DuckDuckG #StartPage #Qwant #MarginaliaSearch #Mwmbl