#aicrawlers — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #aicrawlers, aggregated by home.social.
-
C'est assez rigolo, les bots IA se calment le week-end, on passe de >90% en semaine à 70% "seulement" en fin de semaine...
-
🇬🇧 If you like numbers, our dashboard lets you view all our sites’ statistics in real time.
https://status.gayfr.social
https://status.gayfr.onlineI’ve added new metrics related to Anubis, which protects us against AI bots. So at the bottom, you’ll see the number of bots blocked on sight (red), the number of visits that need to verify the “I’m not a robot” page (orange), the percentage that succeed (meaning they aren’t bots, in yellow), and finally the number of accepted human visits (green).
What’s interesting is that the percentage of bots varies throughout the week but remains very high (between 65% and 96% depending on the time). That’s huge! They attack in swarms, just like a pain in the ass…
That justifies the efforts made to protect against them.
#gayfr #Anubis #IA #AI #ArtificialIntelligence #Bots #AIBots #AICrawlers
-
🇫🇷 Si vous aimez les chiffres, notre tableau de bord vous permet de voir toutes les statistiques de nos sites en temps réel.
https://status.gayfr.social
https://status.gayfr.onlineJ'ai rajouté les nouveaux indicateurs relatifs à Anubis qui nous protège contre les bots IA. Ainsi à la fin vous verrez le nombre de bots bloqués à vue (rouge), le nombre de visites qui doivent valider la page "vous n'êtes pas un robot" (orange), le pourcentage qui réussissent (donc ne sont pas des bots, en jaune) et enfin le nombre de visites humaines acceptées (vert).
Ce qui est intéressant c'est que le % de bots varie dans la semaine mais reste très important (entre 65% et 96% selon les moments). C'est énorme ! Ils attaquent en escadrille, comme les emmerdes...
Ça justifie les efforts faits pour s'en protéger.
#gayfr #Anubis #IA #AI #IntelligenceArtificielle #ArtificialIntelligence #Bots #AIBots #AICrawlers
-
Nos statistiques des consultations hebdomadaires pour nos deux serveurs principaux.
Tout vous paraît normal pour des serveurs francophones ?
Cherchez l’IA...
-
https://winbuzzer.com/2026/02/09/cloudflare-google-search-monopoly-ai-data-advantage-xcxwbn/
Cloudflare: Google Abuses Search Monopoly for 4.8x AI Data Advantage
#AI #Google #Cloudflare #BigTech #Search #AITrainingData #AICrawlers #AITraining #Content #Publishers #SearchResults #SearchEngines
-
Cloudflare launches Content Signals Policy to fight AI crawlers and scrapers
https://web.brid.gy/r/https://nerds.xyz/2025/09/cloudflare-content-signals-policy-ai-crawlers/
-
Cloudflare launches Content Signals Policy to fight AI crawlers and scrapers
https://web.brid.gy/r/https://nerds.xyz/2025/09/cloudflare-content-signals-policy-ai-crawlers/
-
Cloudflare launches Content Signals Policy to fight AI crawlers and scrapers
https://web.brid.gy/r/https://nerds.xyz/2025/09/cloudflare-content-signals-policy-ai-crawlers/
-
Helping protect journalists and local news from AI crawlers with Project Galileo – Cloudflare.com
Helping protect journalists and local news from AI crawlers with Project Galileo
2025-09-23, 5 min read
By Patrick Day and Jocelyn Woolbright
We are excited to announce that Project Galileo will now include access to Cloudflare’s Bot Management and AI Crawl Control services. Participants in the program, which include roughly 750 journalists, independent news organizations, and other non-profits supporting news-gathering around the world, will now have the ability to protect their websites from AI crawlers—for free.
Project Galileo is Cloudflare’s free program to help protect important civic voices online. Launched in 2014, it now includes more than 3,000 organizations in 125 countries, and it has served as the foundation for other free Cloudflare programs that help protect democratic elections, public schools, public health clinics, and other critical infrastructure.
Although we think all Project Galileo participants will benefit from these additional free services, we believe they are essential for news organizations.
News organizations, particularly local news, are facing significant challenges in transitioning to the AI-driven web. As people increasingly turn to AI models for information, less of their web traffic is making it to the actual website where that information originated. Industries, like news organizations, that rely on user traffic to generate revenue are increasingly at-risk.
Allowing news organizations to monitor and control how AI crawlers are interacting with their websites, will help them better protect their content and make more informed decisions about engaging with AI companies. Ultimately, our goal is to provide the tools news organizations need to negotiate fair compensation for their work.
Editor’s Note: Read the rest of the story, at the below link.
Continue/Read Original Article Here: Helping protect journalists and local news from AI crawlers with Project Galileo
#2025 #AICrawlers #America #Cloudflare #CloudflareCom #Education #Health #History #Internet #Journalism #Journalists #Libraries #LibraryOfCongress #Opinion #Reading #Science #Technology #UnitedStates #WebTraffic #Writing
-
The metadata for RSL
#rsl #rslstandard #ai #aicrawlers #AICrawlControl
RSL Specification | RSL: Really Simple Licensing
https://rslstandard.org/rsl -
ICYMI: Cloudflare launches pay per crawl to monetize AI content access: Cloudflare introduces pay per crawl service allowing content creators to charge AI crawlers for access. https://ppc.land/cloudflare-launches-pay-per-crawl-to-monetize-ai-content-access/ #Cloudflare #AIAccess #ContentMonetization #PayPerCrawl #AICrawlers
-
ICYMI: Cloudflare launches pay per crawl to monetize AI content access: Cloudflare introduces pay per crawl service allowing content creators to charge AI crawlers for access. https://ppc.land/cloudflare-launches-pay-per-crawl-to-monetize-ai-content-access/ #Cloudflare #AIAccess #ContentMonetization #PayPerCrawl #AICrawlers
-
3dclive uses Cloudflare to block AI crawlers stealing YOUR content https://3dcandy.live/2025/07/3dclive-uses-cloudflare-to-block-ai-crawlers-stealing-your-content/ #3dcandy #3dclive #ai #aicrawlers #boost #cloudflare #content #stealing
-
Google Analyst Warns: AI Bots Risk Internet Gridlock By Server Overload
#AI #AICrawlers #InternetCongestion #WebPerformance #Google #AIethics #CyberSecurity #FutureOfWeb #AISafety #DataPrivacy
-
In SquareSpace, you can opt to block AI crawlers in Settings. However it doesn't work since ChatGPT appears in my Analytics. Does anyone know if I could add in Website > Utilities > Website Tools > Code Injection this rule without creating any issues:
User-agent : ChatGPT-User
Disallow: / -
AI Crawlers stealing your content? Time to fight back! 💪
LLMs and AI bots are scraping the web, stealing up your data, hogging bandwidth, and even crashing servers under aggressive loads.
Don’t let them freeload! The CrowdSec AI Crawlers Blocklist stops unwanted harvesting before it hurts your site’s performance or privacy.
Regain control over your digital assets: https://crowdsec.net/blog/protect-against-ai-crawlers
#AIcrawlers #blocklists #threatintelligence #cybersecurity #infosec #AIbots #dataprotection
-
So according to the request statistics, since the last rotation of the access log file for the #MacPorts trac this morning, there were:
20.8k requests from IE 3
20.9k requests from IE 4
21.3k requests from IE 5
43 requests from IE 6 and
23 requests from IE 7These requests came from these Windows versions (roughly 4k per version): CE, 95, 98 (9.5k), NT 4, 2000, XP, NT 5.01(?!), Server 2003, Vista, 7, and 8.0.
I'm sure none of those are AI crawler bots.
-
Developers report aggressive AI crawlers overwhelming open-source infrastructure, with LibreNews citing up to 97% of traffic from AI bots on some projects. #AI #OpenSource #TechNews #AIcrawlers #Bots #LibreNews #DeveloperCommunity #Infrastructure #TechIndustry
-
Devs say AI crawlers dominate traffic, forcing blocks on entire countries
#HackerNews #AItraffic #AIcrawlers #DevsNews #InternetRegulation #CountryBlocks
-
Protecting your blog from the dead eyed #AI crawlers. You can experiment with specific robots txt, and I also run a script in htaccess. I think there are metadata properties you can declare. None of this stops your pages being crawled but may afford some legal protection. (See the German Laion case recently). I'm doing a short blogpost on this, soon.