home.social

#googlebot — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #googlebot, aggregated by home.social.

  1. Пять неочевидных вещей, которые я узнал, запуская кино-соцсеть: от robots.txt-ловушки до 24-мерной математики вкуса

    Последние полгода я работаю над VibeMuvik — кино-соцсетью с рецензиями, дебатами и синхронным просмотром фильмов. Одна из тех штук, которые «ну вроде несложно», пока не начинаешь копать. Эта статья — про неожиданные находки . Не про «как я выбрал стек» (скучно) и не про «туториал по WebRTC» (и без меня есть). Это пять ситуаций, в которых я споткнулся, обнаружил что-то интересное, и подумал «об этом стоит рассказать — другим пригодится». Поехали.

    habr.com/ru/articles/1027876/

    #robotstxt #SEO #WebRTC #Nextjs #IndexNow #sitemap #Googlebot #Cinema_DNA #синхронный_просмотр #рекомендательные_системы

  2. ICYMI: Googlebot is not a program - Google engineers finally explain what it really is: Google engineers reveal Googlebot is a misnomer for a central SaaS crawling platform serving dozens of products, with a 15 MB default file size limit and geo-crawling constraints. ppc.land/googlebot-is-not-a-pr #Googlebot #SEO #Crawling #SaaS #DigitalMarketing

  3. Testing tool simulates Google's 2MB HTML limit as SEO professionals assess crawling impact: Dave Smart added 2MB truncation feature to Tame the Bots fetch tool on February 6, enabling technical SEO professionals to simulate Googlebot's reduced file size limits. ppc.land/testing-tool-simulate #SEO #GoogleBot #HTML #Crawling #DigitalMarketing

  4. @cks

    Early results are not promising. I've had a handful of HEAD requests in the past day. Only 2 appear legitimate, in that they hit genuine page URLs. The others were attempts to exploit WordPress vulnerabilities.

    #HTTP #httpd #GoogleBot #djbwares #WordPress

  5. @cks

    Early results are not promising. I've had a handful of HEAD requests in the past day. Only 2 appear legitimate, in that they hit genuine page URLs. The others were attempts to exploit WordPress vulnerabilities.

    #HTTP #httpd #GoogleBot #djbwares #WordPress

  6. @cks

    Early results are not promising. I've had a handful of HEAD requests in the past day. Only 2 appear legitimate, in that they hit genuine page URLs. The others were attempts to exploit WordPress vulnerabilities.

    #HTTP #httpd #GoogleBot #djbwares #WordPress

  7. @cks

    It makes me think that there's one well-behaved 'bot drowned in a sea of ill-behaved ones.

    I'm just instrumenting #djbwares httpd to log GET and HEAD differently. I wonder what I'll see.

    #HTTP #httpd #GoogleBot

  8. @cks

    It makes me think that there's one well-behaved 'bot drowned in a sea of ill-behaved ones.

    I'm just instrumenting #djbwares httpd to log GET and HEAD differently. I wonder what I'll see.

    #HTTP #httpd #GoogleBot

  9. @cks

    It makes me think that there's one well-behaved 'bot drowned in a sea of ill-behaved ones.

    I'm just instrumenting #djbwares httpd to log GET and HEAD differently. I wonder what I'll see.

    #HTTP #httpd #GoogleBot