home.social

#aicrawlers — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #aicrawlers, aggregated by home.social.

  1. C'est assez rigolo, les bots IA se calment le week-end, on passe de >90% en semaine à 70% "seulement" en fin de semaine...

    #gayfr #IA #AI #AIBots #AICrawlers

  2. 🇬🇧 If you like numbers, our dashboard lets you view all our sites’ statistics in real time.

    status.gayfr.social
    status.gayfr.online

    I’ve added new metrics related to Anubis, which protects us against AI bots. So at the bottom, you’ll see the number of bots blocked on sight (red), the number of visits that need to verify the “I’m not a robot” page (orange), the percentage that succeed (meaning they aren’t bots, in yellow), and finally the number of accepted human visits (green).

    What’s interesting is that the percentage of bots varies throughout the week but remains very high (between 65% and 96% depending on the time). That’s huge! They attack in swarms, just like a pain in the ass…

    That justifies the efforts made to protect against them.

    #gayfr #Anubis #IA #AI #ArtificialIntelligence #Bots #AIBots #AICrawlers

  3. 🇫🇷 Si vous aimez les chiffres, notre tableau de bord vous permet de voir toutes les statistiques de nos sites en temps réel.

    status.gayfr.social
    status.gayfr.online

    J'ai rajouté les nouveaux indicateurs relatifs à Anubis qui nous protège contre les bots IA. Ainsi à la fin vous verrez le nombre de bots bloqués à vue (rouge), le nombre de visites qui doivent valider la page "vous n'êtes pas un robot" (orange), le pourcentage qui réussissent (donc ne sont pas des bots, en jaune) et enfin le nombre de visites humaines acceptées (vert).

    Ce qui est intéressant c'est que le % de bots varie dans la semaine mais reste très important (entre 65% et 96% selon les moments). C'est énorme ! Ils attaquent en escadrille, comme les emmerdes...

    Ça justifie les efforts faits pour s'en protéger.

    #gayfr #Anubis #IA #AI #IntelligenceArtificielle #ArtificialIntelligence #Bots #AIBots #AICrawlers

  4. Nos statistiques des consultations hebdomadaires pour nos deux serveurs principaux.

    Tout vous paraît normal pour des serveurs francophones ?

    Cherchez l’IA...

    #IA #AI #AICrawlers #AIBots

  5. Helping protect journalists and local news from AI crawlers with Project Galileo – Cloudflare.com

     Helping protect journalists and local news from AI crawlers with Project Galileo

    2025-09-23, 5 min read

    By Patrick Day and Jocelyn Woolbright

    We are excited to announce that Project Galileo will now include access to Cloudflare’s Bot Management and AI Crawl Control services. Participants in the program, which include roughly 750 journalists, independent news organizations, and other non-profits supporting news-gathering around the world, will now have the ability to protect their websites from AI crawlers—for free. 

    Project Galileo is Cloudflare’s free program to help protect important civic voices online. Launched in 2014, it now includes more than 3,000 organizations in 125 countries, and it has served as the foundation for other free Cloudflare programs that help protect democratic elections, public schools, public health clinics, and other critical infrastructure.  

    Although we think all Project Galileo participants will benefit from these additional free services, we believe they are essential for news organizations. 

    News organizations, particularly local news, are facing significant challenges in transitioning to the AI-driven web. As people increasingly turn to AI models for information, less of their web traffic is making it to the actual website where that information originated. Industries, like news organizations, that rely on user traffic to generate revenue are increasingly at-risk. 

    Allowing news organizations to monitor and control how AI crawlers are interacting with their websites, will help them better protect their content and make more informed decisions about engaging with AI companies. Ultimately, our goal is to provide the tools news organizations need to negotiate fair compensation for their work.

    Editor’s Note: Read the rest of the story, at the below link.

    Continue/Read Original Article Here: Helping protect journalists and local news from AI crawlers with Project Galileo

    #2025 #AICrawlers #America #Cloudflare #CloudflareCom #Education #Health #History #Internet #Journalism #Journalists #Libraries #LibraryOfCongress #Opinion #Reading #Science #Technology #UnitedStates #WebTraffic #Writing

  6. ICYMI: Cloudflare launches pay per crawl to monetize AI content access: Cloudflare introduces pay per crawl service allowing content creators to charge AI crawlers for access. ppc.land/cloudflare-launches-p #Cloudflare #AIAccess #ContentMonetization #PayPerCrawl #AICrawlers

  7. ICYMI: Cloudflare launches pay per crawl to monetize AI content access: Cloudflare introduces pay per crawl service allowing content creators to charge AI crawlers for access. ppc.land/cloudflare-launches-p #Cloudflare #AIAccess #ContentMonetization #PayPerCrawl #AICrawlers

  8. In SquareSpace, you can opt to block AI crawlers in Settings. However it doesn't work since ChatGPT appears in my Analytics. Does anyone know if I could add in Website > Utilities > Website Tools > Code Injection this rule without creating any issues:
    User-agent : ChatGPT-User 
    Disallow: /

    #SquareSpace #AICrawlers #ChatGPT

  9. AI Crawlers stealing your content? Time to fight back! 💪

    LLMs and AI bots are scraping the web, stealing up your data, hogging bandwidth, and even crashing servers under aggressive loads.

    Don’t let them freeload! The CrowdSec AI Crawlers Blocklist stops unwanted harvesting before it hurts your site’s performance or privacy.

    Regain control over your digital assets: crowdsec.net/blog/protect-agai

    #AIcrawlers #blocklists #threatintelligence #cybersecurity #infosec #AIbots #dataprotection

  10. So according to the request statistics, since the last rotation of the access log file for the #MacPorts trac this morning, there were:

    20.8k requests from IE 3
    20.9k requests from IE 4
    21.3k requests from IE 5
    43 requests from IE 6 and
    23 requests from IE 7

    These requests came from these Windows versions (roughly 4k per version): CE, 95, 98 (9.5k), NT 4, 2000, XP, NT 5.01(?!), Server 2003, Vista, 7, and 8.0.

    I'm sure none of those are AI crawler bots.

    #aicrawler #aicrawlers

  11. Developers report aggressive AI crawlers overwhelming open-source infrastructure, with LibreNews citing up to 97% of traffic from AI bots on some projects. #AI #OpenSource #TechNews #AIcrawlers #Bots #LibreNews #DeveloperCommunity #Infrastructure #TechIndustry

  12. Protecting your blog from the dead eyed #AI crawlers. You can experiment with specific robots txt, and I also run a script in htaccess. I think there are metadata properties you can declare. None of this stops your pages being crawled but may afford some legal protection. (See the German Laion case recently). I'm doing a short blogpost on this, soon.

    #robotstxt #aicrawlers #htaccess