#bingbot — Public Fediverse posts on home.social

Rob Pegoraro @[email protected] · 2025-12-19 · 23:52 UTC

Behold the AI bots that Cloudflare blocked from this blog

I don’t like writing for free–social media blatantly excepted–so when I watched a panel at Web Summit in mid-November about the effect of AI-model crawlers on news-site revenue and the Pay Per Crawl initiative that Cloudflare was proposing as a solution, I had to take notes.

Then a few weeks after I got home from Lisbon, I realized I could take action: While Pay Per Crawl remains in an invitation-only beta test, Cloudflare’s AI Crawl Control is open to the public and included in that Internet infrastructure firm’s free tier. Then I learned that it’s shockingly easy to add Cloudflare’s services to a WordPress.com blog.

Crawl Control comes with a preset list of bots to block and bots to allow, grouped by type: “AI Assistant” bots that take action in response to user requests are fine; “AI Search” bots that support “AI-driven search experiences” are also okay (contrary to Cloudflare CEO Matthew Prince’s discussion of them in that Web Summit panel); “AI Crawler” bots that collect content for training AI models are not.

I took a screenshot of this part of my Cloudflare dashboard at almost the same time each afternoon this week, and these are my totals:

Huawei’s PetalBot was the highest-volume AI crawler, with Cloudflare reporting 224 “unsuccessful” request attempts from that Chinese tech giant’s AI crawler (Cloudflare doesn’t take direct credit for blocking bots in this interface), followed by Anthropic’s Claude-SearchBot, with 165 unsuccessful requests.
Among AI assistants, the second-highest category by volume, OpenAI’s ChatGPT-User had 1,251 allowed requests, DuckDuckGo’s DuckAssistBot had 36 allowed, and Perplexity’s Perplexity-User had one unsuccesful request.
The top bot in AI search came from an unlikely place: Apple’s Applebot, with 734 allowed. OpenAI’s OAI-SearchBot was far behind, with 128 allowed requests, while Perplexity’s PerplexityBot had all eight request attempts fail.

To put this in context, the top two search engine crawlers had exponentially higher numbers. Google’s Googlebot somehow racked up a little over 20,000 requests, more than 30 times the presumably-human traffic I see in my WordPress dashboard here for the last five days, and 23 failed requests. Microsoft’s Bingbot came in second with 3,003 allowed requests and two unsuccessful ones.

As Cloudflare’s CEO complained in that Web Summit panel, Googlebot feeds into both Google’s traditional search and the AI Overview search results that Web publishers now blame for dangerous declines in their search traffic. There’s nothing I can do about that from this side of the screen except hope that Cloudflare’s Pay Per Crawl efforts and other advocacy efforts stir some rethinking at Google.

But I can’t tell you how well Pay Per Crawl works, because almost three weeks after applying to join the private beta I’m still waiting for my invitation. I imagine I’ll be waiting much longer before an AI-crawler operator decides that my tiny contribution to the Web’s collective content is worth sending me some money.

#AI #AIBot #AICrawlControl #AICrawler #Amazon #Applebot #Bingbot #ChatGPT #Cloudflare #Huawei #OpenAI #PayPerCrawl #Petalbot

#ai #aibot #aicrawlcontrol #aicrawler #amazon #applebot

Rob Pegoraro @[email protected] · 2025-12-19 · 23:52 UTC

Behold the AI bots that Cloudflare blocked from this blog

I don’t like writing for free–social media blatantly excepted–so when I watched a panel at Web Summit in mid-November about the effect of AI-model crawlers on news-site revenue and the Pay Per Crawl initiative that Cloudflare was proposing as a solution, I had to take notes.

Then a few weeks after I got home from Lisbon, I realized I could take action: While Pay Per Crawl remains in an invitation-only beta test, Cloudflare’s AI Crawl Control is open to the public and included in that Internet infrastructure firm’s free tier. Then I learned that it’s shockingly easy to add Cloudflare’s services to a WordPress.com blog.

Crawl Control comes with a preset list of bots to block and bots to allow, grouped by type: “AI Assistant” bots that take action in response to user requests are fine; “AI Search” bots that support “AI-driven search experiences” are also okay (contrary to Cloudflare CEO Matthew Prince’s discussion of them in that Web Summit panel); “AI Crawler” bots that collect content for training AI models are not.

I took a screenshot of this part of my Cloudflare dashboard at almost the same time each afternoon this week, and these are my totals:

Huawei’s PetalBot was the highest-volume AI crawler, with Cloudflare reporting 224 “unsuccessful” request attempts from that Chinese tech giant’s AI crawler (Cloudflare doesn’t take direct credit for blocking bots in this interface), followed by Anthropic’s Claude-SearchBot, with 165 unsuccessful requests.
Among AI assistants, the second-highest category by volume, OpenAI’s ChatGPT-User had 1,251 allowed requests, DuckDuckGo’s DuckAssistBot had 36 allowed, and Perplexity’s Perplexity-User had one unsuccesful request.
The top bot in AI search came from an unlikely place: Apple’s Applebot, with 734 allowed. OpenAI’s OAI-SearchBot was far behind, with 128 allowed requests, while Perplexity’s PerplexityBot had all eight request attempts fail.

To put this in context, the top two search engine crawlers had exponentially higher numbers. Google’s Googlebot somehow racked up a little over 20,000 requests, more than 30 times the presumably-human traffic I see in my WordPress dashboard here for the last five days, and 23 failed requests. Microsoft’s Bingbot came in second with 3,003 allowed requests and two unsuccessful ones.

As Cloudflare’s CEO complained in that Web Summit panel, Googlebot feeds into both Google’s traditional search and the AI Overview search results that Web publishers now blame for dangerous declines in their search traffic. There’s nothing I can do about that from this side of the screen except hope that Cloudflare’s Pay Per Crawl efforts and other advocacy efforts stir some rethinking at Google.

But I can’t tell you how well Pay Per Crawl works, because almost three weeks after applying to join the private beta I’m still waiting for my invitation. I imagine I’ll be waiting much longer before an AI-crawler operator decides that my tiny contribution to the Web’s collective content is worth sending me some money.

#AI #AIBot #AICrawlControl #AICrawler #Amazon #Applebot #Bingbot #ChatGPT #Cloudflare #Huawei #OpenAI #PayPerCrawl #Petalbot

#ai #aibot #aicrawlcontrol #aicrawler #amazon #applebot

Rob Pegoraro @[email protected] · 2025-12-19 · 23:52 UTC

Behold the AI bots that Cloudflare blocked from this blog

I don’t like writing for free–social media blatantly excepted–so when I watched a panel at Web Summit in mid-November about the effect of AI-model crawlers on news-site revenue and the Pay Per Crawl initiative that Cloudflare was proposing as a solution, I had to take notes.

Then a few weeks after I got home from Lisbon, I realized I could take action: While Pay Per Crawl remains in an invitation-only beta test, Cloudflare’s AI Crawl Control is open to the public and included in that Internet infrastructure firm’s free tier. Then I learned that it’s shockingly easy to add Cloudflare’s services to a WordPress.com blog.

Crawl Control comes with a preset list of bots to block and bots to allow, grouped by type: “AI Assistant” bots that take action in response to user requests are fine; “AI Search” bots that support “AI-driven search experiences” are also okay (contrary to Cloudflare CEO Matthew Prince’s discussion of them in that Web Summit panel); “AI Crawler” bots that collect content for training AI models are not.

I took a screenshot of this part of my Cloudflare dashboard at almost the same time each afternoon this week, and these are my totals:

Huawei’s PetalBot was the highest-volume AI crawler, with Cloudflare reporting 224 “unsuccessful” request attempts from that Chinese tech giant’s AI crawler (Cloudflare doesn’t take direct credit for blocking bots in this interface), followed by Anthropic’s Claude-SearchBot, with 165 unsuccessful requests.
Among AI assistants, the second-highest category by volume, OpenAI’s ChatGPT-User had 1,251 allowed requests, DuckDuckGo’s DuckAssistBot had 36 allowed, and Perplexity’s Perplexity-User had one unsuccesful request.
The top bot in AI search came from an unlikely place: Apple’s Applebot, with 734 allowed. OpenAI’s OAI-SearchBot was far behind, with 128 allowed requests, while Perplexity’s PerplexityBot had all eight request attempts fail.

To put this in context, the top two search engine crawlers had exponentially higher numbers. Google’s Googlebot somehow racked up a little over 20,000 requests, more than 30 times the presumably-human traffic I see in my WordPress dashboard here for the last five days, and 23 failed requests. Microsoft’s Bingbot came in second with 3,003 allowed requests and two unsuccessful ones.

As Cloudflare’s CEO complained in that Web Summit panel, Googlebot feeds into both Google’s traditional search and the AI Overview search results that Web publishers now blame for dangerous declines in their search traffic. There’s nothing I can do about that from this side of the screen except hope that Cloudflare’s Pay Per Crawl efforts and other advocacy efforts stir some rethinking at Google.

But I can’t tell you how well Pay Per Crawl works, because almost three weeks after applying to join the private beta I’m still waiting for my invitation. I imagine I’ll be waiting much longer before an AI-crawler operator decides that my tiny contribution to the Web’s collective content is worth sending me some money.

#AI #AIBot #AICrawlControl #AICrawler #Amazon #Applebot #Bingbot #ChatGPT #Cloudflare #Huawei #OpenAI #PayPerCrawl #Petalbot

#petalbot #paypercrawl #openai #huawei #cloudflare #chatgpt

Rob Pegoraro @[email protected] · 2025-12-19 · 23:52 UTC

Behold the AI bots that Cloudflare blocked from this blog

I don’t like writing for free–social media blatantly excepted–so when I watched a panel at Web Summit in mid-November about the effect of AI-model crawlers on news-site revenue and the Pay Per Crawl initiative that Cloudflare was proposing as a solution, I had to take notes.

Then a few weeks after I got home from Lisbon, I realized I could take action: While Pay Per Crawl remains in an invitation-only beta test, Cloudflare’s AI Crawl Control is open to the public and included in that Internet infrastructure firm’s free tier. Then I learned that it’s shockingly easy to add Cloudflare’s services to a WordPress.com blog.

Crawl Control comes with a preset list of bots to block and bots to allow, grouped by type: “AI Assistant” bots that take action in response to user requests are fine; “AI Search” bots that support “AI-driven search experiences” are also okay (contrary to Cloudflare CEO Matthew Prince’s discussion of them in that Web Summit panel); “AI Crawler” bots that collect content for training AI models are not.

I took a screenshot of this part of my Cloudflare dashboard at almost the same time each afternoon this week, and these are my totals:

Huawei’s PetalBot was the highest-volume AI crawler, with Cloudflare reporting 224 “unsuccessful” request attempts from that Chinese tech giant’s AI crawler (Cloudflare doesn’t take direct credit for blocking bots in this interface), followed by Anthropic’s Claude-SearchBot, with 165 unsuccessful requests.
Among AI assistants, the second-highest category by volume, OpenAI’s ChatGPT-User had 1,251 allowed requests, DuckDuckGo’s DuckAssistBot had 36 allowed, and Perplexity’s Perplexity-User had one unsuccesful request.
The top bot in AI search came from an unlikely place: Apple’s Applebot, with 734 allowed. OpenAI’s OAI-SearchBot was far behind, with 128 allowed requests, while Perplexity’s PerplexityBot had all eight request attempts fail.

To put this in context, the top two search engine crawlers had exponentially higher numbers. Google’s Googlebot somehow racked up a little over 20,000 requests, more than 30 times the presumably-human traffic I see in my WordPress dashboard here for the last five days, and 23 failed requests. Microsoft’s Bingbot came in second with 3,003 allowed requests and two unsuccessful ones.

As Cloudflare’s CEO complained in that Web Summit panel, Googlebot feeds into both Google’s traditional search and the AI Overview search results that Web publishers now blame for dangerous declines in their search traffic. There’s nothing I can do about that from this side of the screen except hope that Cloudflare’s Pay Per Crawl efforts and other advocacy efforts stir some rethinking at Google.

But I can’t tell you how well Pay Per Crawl works, because almost three weeks after applying to join the private beta I’m still waiting for my invitation. I imagine I’ll be waiting much longer before an AI-crawler operator decides that my tiny contribution to the Web’s collective content is worth sending me some money.

#AI #AIBot #AICrawlControl #AICrawler #Amazon #Applebot #Bingbot #ChatGPT #Cloudflare #Huawei #OpenAI #PayPerCrawl #Petalbot

#ai #aibot #aicrawlcontrol #aicrawler #amazon #applebot

shaun @[email protected] · 2025-07-10 · 00:21 UTC

@grumpybozo

Here's an example of today's #Bing spam. The site is an automated network abuse log (everything my machines submit to AbuseIPDB, also gets posted there.) It has a minimal search function in case someone wants to look up abuse incidents by IP etc.

There's no reason for #Bingbot to be posting this form _at all_, much less with all of this garbage. Citrus marinated shrimp‽ Really‽

If anyone from #Microsoft happens to see this, I'd sure love some insight into WTF Bingbot is doing.

#bing #bingbot #microsoft

Stuart Longland (VK4MSL) @[email protected] · 2025-02-09 · 09:50 UTC

Ohh… hello #BingBot … come for some junk food?

```
40.77.167.13 - - [09/Feb/2025:16:47:05 +1000] "GET /meseemed/grouses/miniprices HTTP/1.1" 200 2647 "-" "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm) Chrome/116.0.1938.76 Safari/537.36" "-"
40.77.167.13 - - [09/Feb/2025:16:47:08 +1000] "GET /pseudarachnidan HTTP/1.1" 200 3389 "-" "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm) Chrome/116.0.1938.76 Safari/537.36" "-"
```

#bingbot

Mike P @[email protected] · 2025-01-18 · 13:40 UTC

A while back I caught #YandexBot trying to index my #OAuth token endpoint, and now #Bingbot is trying to sign up as a new user - by visiting a URL which is not only disabled, but also isn't linked to from anywhere on the web.

How did they find it, I wonder? Hmm.

I'm not spending significant bandwidth on search engine bots, but I'm tempted to block them anyway, just so that people have to be more creative with their forged User-Agent headers.

#webdev

#yandexbot #oauth #bingbot #webdev

Jason Yip @[email protected] · 2023-05-06 · 14:08 UTC

#BingBot is pretty useful to help create #regex

#regex #bingbot

Schamschula @[email protected] · 2023-04-07 · 11:49 UTC

@nixCraft PS: Microsoft’s #bingbot gets a whole lot of code 400 responses, as it can’t figure out that a query string can be case sensitive.

#bingbot

Brent G Warner @[email protected] · 2023-03-21 · 23:03 UTC

Bing's Image creator is released. Here was my prompt and the result: An ominous figure of the grim reaper draped in chains, but the chains are rainbow colored, and instead of a scythe he's holding a giant lollipop.
https://www.bing.com/images/create
#BingBot #GenerativeAI

#generativeai #bingbot

Brent G Warner @[email protected] · 2023-03-15 · 16:04 UTC

BingBot: First Impressions of Bing’s AI Chatbot. – AI in ESL http://aiinesl.com/bingbot-first-impressions-of-bings-ai-chatbot/
#ChatGPT #Bing #BingBot #AIinEducation #AIinESL

#aiinesl #aiineducation #bingbot #bing #chatgpt

Brent G Warner @[email protected] · 2023-02-27 · 16:03 UTC

The #BingBot Backfires: https://www.npr.org/2023/02/27/1159630243/how-microsofts-experiment-in-artificial-intelligence-tech-backfired
#ChatGPT #Bing #AI

#ai #bing #chatgpt #bingbot

Brent G Warner @[email protected] · 2023-02-25 · 23:04 UTC

Also realizing now that #BingAI obviously needs to be called #BingBot

#bingbot #bingai

Jason Yip @[email protected] · 2023-02-17 · 19:32 UTC

Effective #ProductDevelopment #Culture is:
- Team-oriented;
- Data-driven;
- Customer-centric;
- Innovative;
- Collaborative

Not bad #BingBot...

#bingbot #culture #productdevelopment

Deb has moved! (see profile) 🇨🇦 @[email protected] · 2023-02-17 · 07:03 UTC

"Whether you love or hate AI, you should be trying to pick it apart — to identify its quirks and vulnerabilities and fix problems before they’re exploited for real harm (or just to let spammers game your search results). You’re not going to hurt Bing AI by doing this; you’re going to make it better, no matter how many passive-aggressive faces it gives you. Trying to avoid making Bing cry-emoji just gives Microsoft a pass."

#chatBot #ai #chatgpt #bingbot

https://www.theverge.com/23601763/bing-ai-search-guilt-trip-emotional-manipulation

#chatbot #ai #chatgpt #bingbot

Deb has moved! (see profile) 🇨🇦 @[email protected] · 2023-02-13 · 19:50 UTC

@jchyip whoa! That's quite disturbing :-(

I mean ... really. :-((

#ai #chatBot #chatGPT #bingBot

#ai #chatbot #chatgpt #bingbot

Deb has moved! (see profile) 🇨🇦 @[email protected] · 2023-02-13 · 18:11 UTC

@shawngoldwater ... pretty much garbage-in/garbage out, right?

I shall think of it as chatGIGO henceforth.

#chatbot #chatGPT #bingBot

#chatbot #chatgpt #bingbot

Deb has moved! (see profile) 🇨🇦 @[email protected] · 2023-02-13 · 14:02 UTC

@ironicsans so, is Bing now a social media platform? It's gobbling up memes* and spewing them back as unattributed facts ... which sounds a lot like Twitter ...

* "memes" in the academic sense, not the lolCat sense

#ai #chatBot #chatGPT #bingbot

#ai #chatbot #chatgpt #bingbot

Deb has moved! (see profile) 🇨🇦 @[email protected] · 2023-02-13 · 13:48 UTC

Wow. This is ... disturbing.

Bing chatbot reads user's mind on what they think is important/unimportant and tweaks answers accordingly ... "sorry, I thought you wouldn't mind."

Does anyone remember how worried we were about being in isolated information "bubbles," not so long ago? Bing looks like a bubble machine :-(

@ironicsans #ai #chatBot #bingbot https://mastodon.social/@ironicsans/109836073193847865

#ai #chatbot #bingbot