home.social

#mwmbl — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #mwmbl, aggregated by home.social.

  1. What are all global web #search engines with own #index?

    I don't mean country specific ones like @yandex , @Baidu_Inc, #Naver or #Seznam

    But global like @Google , @bing , @brave search, @mojeek , @PriEcoSearch , #mwmbl
    #search #index #Naver #Seznam #mwmbl

  2. Stats page on the #OpenSource #SearchEngine #Mwmbl is finally fixed. And crawling should be working again.

    mwmbl.org/stats/

    Feeling a new level of motivation after attending the excellent #OSSym24 🎉

  3. It turns out that a #PostgresSQL table to store URLs crawled is a bad idea: the table is currently 749G on disk. In contrast, a bloom filter with a 1e-7 error for a billion items will take up about 4G hur.st/bloomfilter/?n=10000000

    #mwmbl #opensource #searchengine