home.social

#postmortems — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #postmortems, aggregated by home.social.

  1. Later this month, we'll have the recording of our second episode of #OfficeOfTheITGuy. I am seeking a seasoned guest to talk about incidents and #postMortems. Extra credit if you have something to showcase on the show. #IT #DevOps #Podcast

  2. @thisismissem @sgf "blame" is not the same thing as "assigning responsibility".

    A good red flag for this is teams that say "We do #blameless #postmortems by not naming anyone in the postmortem".

    No! You know you have a blameless postmortem culture when you *can* name people in postmortems without it causing problems.

    This can be exceptionally hard to achieve, but it's worth it.

    Edit: see also @danslimmon blog.danslimmon.com/2023/04/20

  3. heard that Twitter is DDoSing itself (?)

    This is a good opportunity to announce I specialize in software perf & scalability. Reducing hosting costs. And parachuting in to solve hard bugs or otherwise "rescue" sites or projects farked up by a prior approach

    as a paid consultant

    #DDoS
    #Twitter
    #performance
    #scalability
    #scaling
    #tuning
    #CostReduction
    #ResourceMinimization
    #troubleshooting
    #rescues
    #rewrites
    #systems
    #RootCauseAnalysis
    #regressions
    #postmortems
    #architecture
    #efficiency
    #SRE

  4. heard that Twitter is DDoSing itself (?)

    This is a good opportunity to announce I specialize in software perf & scalability. Reducing hosting costs. And parachuting in to solve hard bugs or otherwise "rescue" sites or projects farked up by a prior approach

    as a paid consultant

    #DDoS
    #Twitter
    #performance
    #scalability
    #scaling
    #tuning
    #CostReduction
    #ResourceMinimization
    #troubleshooting
    #rescues
    #rewrites
    #systems
    #RootCauseAnalysis
    #regressions
    #postmortems
    #architecture
    #efficiency
    #SRE

  5. heard that Twitter is DDoSing itself (?)

    This is a good opportunity to announce I specialize in software perf & scalability. Reducing hosting costs. And parachuting in to solve hard bugs or otherwise "rescue" sites or projects farked up by a prior approach

    as a paid consultant

    #DDoS
    #Twitter
    #performance
    #scalability
    #scaling
    #tuning
    #CostReduction
    #ResourceMinimization
    #troubleshooting
    #rescues
    #rewrites
    #systems
    #RootCauseAnalysis
    #regressions
    #postmortems
    #architecture
    #efficiency
    #SRE

  6. heard that Twitter is DDoSing itself (?)

    This is a good opportunity to announce I specialize in software perf & scalability. Reducing hosting costs. And parachuting in to solve hard bugs or otherwise "rescue" sites or projects farked up by a prior approach

    as a paid consultant

    #DDoS
    #Twitter
    #performance
    #scalability
    #scaling
    #tuning
    #CostReduction
    #ResourceMinimization
    #troubleshooting
    #rescues
    #rewrites
    #systems
    #RootCauseAnalysis
    #regressions
    #postmortems
    #architecture
    #efficiency
    #SRE

  7. "Eventually this customer has had enough. They leave. This represents both a sizable blow to revenue and a scathing indictment of your product’s reliability at scale. But, on the bright side, both MTTR and MTBF benefit enormously! That’ll look great on the quarterly slide deck." (~700w)

    blog.danslimmon.com/2023/04/04 #sre #devops #incidentresponse #postmortems

  8. We do that anyway after incidents with #postmortems, but good time to reflect on procedures that we typically do.

    Can absolutely recommend this practice, it also is a great time for the team to share past #incident stories with each other...
    [4/6]

  9. @nova @hazelweakly As a seasoned developer/etc who's also had to do devops work, I deeply appreciate your postmortems. I love the transparency with the community.

    And SO well done! And I'm actually going to borrow some of the sections for our company's. 💯 ❤️

    #postmortems #devops #hachyderm

  10. pleased with this slide of mine from our monthly major incident meta-review, encouraging us towards #LearningFromIncidents and away from focusing on incident statistics

    the first half says: "The insights generated from reviewing incidents are primarily qualitative, because incidents are emergent behavior"

    the second half says "There is no relationship between the impact of an incident and the quality of insights generated through the review process"

    #Postmortems #SRE #IncidentResponse

  11. 2011, Los Alamos, at a for-profit nuclear lab:
    "Technicians settled on what seemed like a surefire way to win praise from their bosses: In a hi-tech testing and manufacturing building pivotal to sustaining America's nuclear arsenal, they gathered eight rods painstakingly crafted out of plutonium, and positioned them side-by-side on a table to photograph how nice they looked."

    science.org/content/article/ne

    via news.ycombinator.com/item?id=3

    #postmortem #postmortems
    #nuclearsafety

  12. #postmortems of failures in our technology, a whole forum to enjoy, recently updated with today's Cloudflare outage and a collection of other BGP mishaps:

    postmortems.info/

    #postmortem #bgp #cloudflare

    (boosts welcome!)

  13. Jonathan Hall of the Tiny DevOps Guy podcast interviewed me for episode 2! youtube.com/watch?v=-i-zRZ8nRa I discussed how #aviation can give us lessons for tech. Some of the topics included #postmortems, human factors impacting performance, accident chains (in both aviation and IT), safe attitudes, etc.

  14. If you like #postmortems - and this one's a beaut - head over to, explore, and sign up at
    postmortems.info

    (ref: cloudflare's #postmortem)

    @leyrer @cstrotm