home.social

#webstats — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #webstats, aggregated by home.social.

  1. Turns out a load of people in the UK set their alarm for 07:00 - closely followed by 07:17 then 08:11 (which is a good deal higher than 08:00).

    These are the RPS stats for the BBC service which (re)validates authentication sessions.

    You can also clearly see a spike around 09:30 when, presumably, people have looked through email and check the news. Then there's the spike for the 13:00 news/lunch break.

    We're all creatures of habit, it's the same *every* day.

    #BBC #WebStats #WebDev

  2. Turns out a load of people in the UK set their alarm for 07:00 - closely followed by 07:17 then 08:11 (which is a good deal higher than 08:00).

    These are the RPS stats for the BBC service which (re)validates authentication sessions.

    You can also clearly see a spike around 09:30 when, presumably, people have looked through email and check the news. Then there's the spike for the 13:00 news/lunch break.

    We're all creatures of habit, it's the same *every* day.

    #BBC #WebStats #WebDev

  3. 83% of requests (globally) to our apex homepages (bbc.co.uk/ and bbc.com/) are from "Python Requests".

    That's 84 million requests per day.

    In the words of Bricktop...

    #BBC #WebStats #PythonRequests #BrickTop

  4. We monitor how much traffic we get from each country per day to check errors/blocking etc. (as I've previously mentioned).

    Big drop-off recently from Angola & as usual it's an ISP (TV Cabo) changing their "is the internet working" config.

    That prompted me to (again) look at how much we're serving to Fortigate "is the internet working" checks per month, roughly:

    - 109 million web pages
    - 10 Terabytes of egress

    What a waste.

    Make `head` requests rather than `get`, please!

    #BBC #WebStats

  5. Wow, as reported elsewhere, traffic from Iran to www.bbc.co.uk & www.bbc.com has dropped to virtually nothing as of yesterday.

    Only 3 ASNs have made any requests at all today:
    - iran information technology company pjsc
    - netiface limited
    - m247 europe srl (unsure if this is *really* in Iran or not)

    We'd normally see about 200k requests/day from Iran and today we've had just over 600.

    #Iran #InternetShutdown #WebStats

  6. ~51,000 T-Mobile USA IP addresses make a total of 13 million requests to bbc.com/ using `python-requests`.

    Being on the internet is fun.

    #WebStats #WebDev

  7. 3% of requests to www.bbc.co.uk & www.bbc.com have a `user-agent` of OKHttp.

    Part of that 3% is a single IP in Turkey which is making nearly 3 million request per day using OKHttp on it's own.

    Being on the internet is fun.

    #WebStats #WebDev #BBC #Bots #WhoRuinedTheInternet

  8. Page views± on the Celebrity Traitors Live Page as a global heatmap.

    We had requests from 136 countries all-in.

    Didn't watch any of it myself but folks seemed to enjoy it - was definitely a big chunk of traffic.

    ± from "likely humans" - not obvious bots

    #WebStats #BBC #CelebrityTraitors

  9. Spot when the final of Traitors started.
    (this is a small subset of our logs)
    #BBC #iPlayer #WebStats

  10. Hmm chatgpt ser ud til at holde fobindelsen til min hjemmeside i live meeeeget længe. Til sammenligning er besøg fra copilot og perplexity normal <1sek.
    #webstats #AIscrapers #weird #danskertrut

  11. The BigQuery plugin for Grafana now (since v3) supports querying via the BigQuery Storage API.

    We updated our plugin yesterday so I've started enabling the Storage API (it's a toggle in the query editor).

    So far, I am seeing the Storage API being *significantly* faster, my estimate would be that it's at least 2-3x faster. Should also be cheaper, AFAICT.

    #BigQuery #Grafana #WebStats

    grafana.com/grafana/plugins/gr

    cloud.google.com/bigquery/docs

  12. Unsure if it's in other SQL ecosystems but I *love* the wildcard syntax BigQuery has - makes it (intentionally) super easy with our data taxonomy.

    If you have a load of tables inside a dataset, you can query across all of those tables with e.g.:

    ```
    select col1, col2 from `some_dataset.*` where ...
    ```
    We have a dataset per "service" (CDN, load balancer etc.) so if you want to query across everything on that services, it's nice and simple.

    #BigQuery #SQL #WebStats #WebDev #BBC

  13. Essex County Council makes nearly 1 million requests per day to the ancient and creaky news.bbc.co.uk (which doesn't even work over HTTPS).
    Edit: With an empty user agent

    What's the betting this is *yet again* "is the internet working" from some crufty old firewall?

    #WebStats #BBC

  14. I've been looking at the logs for our zone apexes today (bbc.co.uk & bbc.com - which are simply redirects to their www equivs).

    Good to see this is getting used for so many valid cases...oh, wait. FML

    #BBC #WebStats #WebDev

  15. I've been working on an automated triager for the frequent volumetric DDOS we see against www.bbc.com & www.bbc.co.uk.

    The idea is to use our edge access logs (stored in BigQuery) to isolate & describe the attack traffic then recommend any additional mitigations/filters etc. It also gives us a database of DDOS metrics/sources we can reference.

    Obviously I had to add the obligatory pew-pew map.

    #WebStats #InfosSec #DDOS #BBC

  16. I've been working on an automated triager for the frequent volumetric DDOS we see against www.bbc.com & www.bbc.co.uk.

    The idea is to use our edge access logs (stored in BigQuery) to isolate & describe the attack traffic then recommend any additional mitigations/filters etc. It also gives us a database of DDOS metrics/sources we can reference.

    Obviously I had to add the obligatory pew-pew map.

    #WebStats #InfosSec #DDOS #BBC

  17. I've been working on an automated triager for the frequent volumetric DDOS we see against www.bbc.com & www.bbc.co.uk.

    The idea is to use our edge access logs (stored in BigQuery) to isolate & describe the attack traffic then recommend any additional mitigations/filters etc. It also gives us a database of DDOS metrics/sources we can reference.

    Obviously I had to add the obligatory pew-pew map.

    #WebStats #InfosSec #DDOS #BBC

  18. I've been working on an automated triager for the frequent volumetric DDOS we see against www.bbc.com & www.bbc.co.uk.

    The idea is to use our edge access logs (stored in BigQuery) to isolate & describe the attack traffic then recommend any additional mitigations/filters etc. It also gives us a database of DDOS metrics/sources we can reference.

    Obviously I had to add the obligatory pew-pew map.

    #WebStats #InfosSec #DDOS #BBC

  19. I've been working on an automated triager for the frequent volumetric DDOS we see against www.bbc.com & www.bbc.co.uk.

    The idea is to use our edge access logs (stored in BigQuery) to isolate & describe the attack traffic then recommend any additional mitigations/filters etc. It also gives us a database of DDOS metrics/sources we can reference.

    Obviously I had to add the obligatory pew-pew map.

    #WebStats #InfosSec #DDOS #BBC

  20. The % of user agents which are known bots or have "bot" in their UA string yesterday (a normal Monday) were:

    www.bbc.co.uk: 3.0%
    www.bbc.com: 8.7%

    That's more than 33 million web pages in total.

    #WebStats #WebDev #AI #Bots

  21. What are people using to serve a static website created with hugo? My preference is for a docker image, and for the server to include some statistics on hits.

    #hugo #webserver #webstats

  22. We alert on total daily requests per country to www.bbc.co.uk & www.bbc.com. This is done is by comparing daily total requests per day by country (from GeoIP) versus a % of the 30 day median value.

    Recently, we've had a number of African countries which've fired alerts & after triaging, they all had a single-digit number of BBC World Service articles get super popular for a few days which skewed the median. No idea why. Any ideas?
    Example: bbc.com/storyworks/specials/re

    #BBC #WebStats

  23. Just raised the 1000th Pull Request on the logging pipeline I run (which ingests our CDN and client telemetry/error logs).

    Feels like quite a milestone and since the repo has been active for just over 3 years old, that's over 300 Pull Requests per year which averages out to more than one Pull Request every working day.

    #BBC #WebDev #WebStats

    cloud.google.com/blog/products

  24. Seems a lot of people are interested in the US election...here's our "last 30 days of total requests per ASN" for "other" (total for all ASNs outside the top 100).
    Requests yesterday were more than 2x the mean value over the previous 28 days.
    #BBC #USElection #WebStats

  25. Man Utd have sacked their manager, Erik Ten Haag.

    The "Breaking News" article doubled our request rate on www.bbc.co.uk.

    bbc.co.uk/sport/football/artic

    #WebStats #WebDev #BBC #Football

  26. We monitor traffic to www.bbc.co.uk & www.bbc.com per country & got alerts that daily requests from Angola have dropped off loads recently.

    Looking at the Angola traffic split by network AS. AS36907 traffic looks suspicious! Spidey sense triggered...the "before" traffic was *way* too consistent.

    Digging in to the logs, looks like they removed their Fortigates on 6th Sept. which'd been sending 343k req/day for www.bbc.co.uk/ , every single day!

    Gotta love being on the internet!

    #WebStats #BBC

  27. Earlier today we re-enabled http/2 on www.bbc.co.uk (& associated domains) on our UK traffic managers after temporarily disabling it whilst we made some changes to help prevent certain VPNs which return incorrect DNS answers to try to circumvent geographical rights restrictions.

    Before the change, h1 was ~90%, now it's ~20-25%.

    Looks like most clients using http/1 these days are scrapers/indexers/monitoring/junk.

    (times in UTC, our UK edge doesn't support h3 yet)

    #http #webDev #webStats #BBC

  28. Interesting.

    I get traffic reports from both Google and Bing on our website's performance. I also get raw traffic data from Cloudfare.

    Google keeps saying my traffic is down each month. Bing right now is reporting HIGHER numbers than Google. Google keeps pushing adsense on us as something we should "increase use of".

    I wonder if us opting out of Google's ad trackers, and adsense, in February this year with our site redesign is the cause. ;) (we still have the most basic analytics stuff, via Yoast SEO, running).

    Meanwhile, Cloudfare reports real numbers. We had 440K uniques in July. Google says we had 135K. (6 mo ago they said we had around 210K). Bing 144K. Our site logs? (after bots, etc weeded out): 442K.

    What's really interesting is that Bing reports we have higher traffic than Google does. This is the first month that's happened.

    DON'T TRUST GOOGLE's ANALYTICS.

    #google #webanalytics #googleanalytics #webstats #webtraffic

  29. Does anyone know what happened to internet access in Turkmenistan from the 11th to the 19th of July 2024?
    We saw roughly a 75% reduction in requests to www.bbc.co.uk & www.bbc.com over that period.
    Traffic looks to have returned now (possibly at a higher level).
    Curious...Could be a scraper or something I guess. Not a lot of traffic.
    #Turkmenistan #WebStats #Censorship (maybe)

  30. Just had a look at the daily total requests to www.bbc.co.uk & www.bbc.com from Bangladesh.
    When the shutdown was enacted, we saw a reduction from ~1.4M requests/day to ~5-7K requests/day. Roughly a 99.5% drop-off.
    Looks like the only remaining traffic is from the alt-net style (smaller) ISPs (and I'd assume government, military etc.).
    #Bangladesh #InternetShutdown #Censorship #WebStats

  31. #Canadian #media - well, the #legacy ones at least - are up in arms over the fact that #Meta / #Facebook is planning to block #news from appearing in #feeds in #Canada.

    You managed to get a #law passed that means they would have to #pay you for the privilege of #linking to you, didn't you?

    Now they get to choose whether to continue linking to you, or not.

    You can't have it both ways with "You must link to us, but you must pay us to link".

    I think they're in for a rude #webstats awakening.