home.social

#saucenao — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #saucenao, aggregated by home.social.

  1. I'm really not happy with how the #antitrust case went against #Google. The fed should've targetted its real #monopoly in #browser technology and video hosting via #YouTube.

    As much as I hate to admit it, Google does have a point when it defended itself from accusations of being a monopolist in
    #search. Users can easily change their default engine, and I've done it always when I install a browser which has Google as its default. In fact I find #DuckDuckGo which is based in #Bing to be better sometimes in web search, and a 100x better in image search. I only use Google for its reverse image search, but even then I complement it with #TinEye and #SauceNAO.

    This decision is not only an insult to the users' intelligence but also a distraction from real issues.

    I guess the only good thing that can come out of this is that
    #Mozilla may be forced to use another default search for #Firefox, which means less funding. Could help eventually defeat Mozilla's monopoly on "alternative browsers"... ​:seija_coffee:​

    #openweb

  2. Hmm I probably have the most ridiculous #robotstxt for a #Misskey instance right now lol. I just want to let #Mojeek and #Marginalia crawl #Makai and make sure to keep out #Google and the AI scrapers... ​:satrithink:​

    If there are other user-agents of independent
    #searchengines I should allow in https://makai.chaotic.ninja/robots.txt, please let me know! I'm actually searching #SauceNAO, #TinEye, and #IQDB's #useragent so I can let them fetch our media for their reverse image search.

    User-Agent: MojeekBot
    User-Agent: FeedFetcher-Mojeek
    User-Agent: search.marginalia.nu
    Allow: /
    Allow: /notes
    Disallow: /admin
    Disallow: /settings
    Disallow: /my/
    
    User-Agent: *
    User-Agent: Googlebot
    User-Agent: Google-Extended
    User-Agent: GoogleOther
    User-Agent: AdsBot-Google
    User-Agent: AdsBot-Google-Mobile
    User-Agent: Mediapartners-Google
    User-Agent: CCBot
    User-Agent: ChatGPT-User
    User-Agent: GPTBot
    User-Agent: Omgilibot
    User-Agent: omgili
    User-Agent: FacebookBot
    User-agent: Twitterbot
    User-Agent: cohere-ai
    User-Agent: anthropic-ai
    User-Agent: Bytespider
    User-Agent: Amazonbot
    User-Agent: Applebot
    User-Agent: PerplexityBot
    User-Agent: YouBot
    User-Agent: AwarioRssBot
    User-Agent: AwarioSmartBot
    User-Agent: ClaudeBot
    User-Agent: Claude-Web
    User-Agent: DataForSeoBot
    User-Agent: FriendlyCrawler
    User-Agent: ImagesiftBot
    User-Agent: magpie-crawler
    User-Agent: Meltwater
    User-Agent: peer39_crawler
    User-Agent: PiplBot
    User-Agent: Seekr
    Disallow: /
    
    # todo: sitemap

    #sysadmin #fediadmin