home.social

#sitereliabilityengineering — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #sitereliabilityengineering, aggregated by home.social.

  1. The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE

    Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership

    atozofsoftwareengineering.blog

  2. The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE

    Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership

    atozofsoftwareengineering.blog

  3. The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE

    Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership

    atozofsoftwareengineering.blog

  4. The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE

    Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership

    atozofsoftwareengineering.blog

  5. The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE

    Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership

    atozofsoftwareengineering.blog

  6. 🚀 Do you know how to define release pipelines? Have experience in building environments from developer sandboxes to production? Then you're the right person to talk at DevConf.CZ 2025!

    Any tools and best practices in #DevOps and #Automation, agile development practices, continous application development or testing strategies are welcome!

    👉 Submit your proposal now at pretalx.devconf.info/devconf-c

    #Ansible, #AIOps, #CI/CD, #Tekton, #ArgoCD, #SiteReliabilityEngineering

  7. 🚀 Do you know how to define release pipelines? Have experience in building environments from developer sandboxes to production? Then you're the right person to talk at DevConf.CZ 2025!

    Any tools and best practices in and , agile development practices, continous application development or testing strategies are welcome!

    👉 Submit your proposal now at pretalx.devconf.info/devconf-c

    , , /CD, , ,

  8. #TechDebt isn't something you "clean up".
    It's something you inherit.

    Old budgets.
    Old decisions.
    Old survival strategies.
    (Patterns go brrr.)

    I rewrote my tech-debt essay and published it on #Substack.
    It's about why planning feels like necromancy,
    why teams repeat failure modes,
    and how language becomes infrastructure.

    If you've ever thought
    "this technically works, but something’s off":
    this is for you.

    👉 Tech Debt Isn't Bad Code—It's Encoded Legacy Patterns
    📎 systemicengineering.substack.c

    #SRE #SiteReliabilityEngineering #HumanSystems #SystemsThinking

  9. Reliability.
    Consistent results under load.

    #SiteReliabilityEngineering.
    ..

    Your team is a #DistributedSystem.
    Language is the transport layer.
    And truth is local.
    (Site.)

    #TechDebt slows down delivery.
    Decisions are unowned.
    And people burn out.
    (Reliability.)

    Divergent realities are a primary (in)variant of human systems.
    Linguistic precision counters entropy accruing ambiguity.
    And coherence is regulative.
    (Engineering.)

    ..

    Intrigued?
    I write about language, technology and #HumanSystems.
    👉 systemic.engineering/trauma-aw

    #SystemicEngineering #SocioTechSRE #SREforHumans #SRE

  10. Agents enter the room.
    Quick! What do you do?
    ..
    (Don't look at me.)

    Agents are non-embodied actors in human systems.
    Agents receive context as input.
    Agents decide, execute, loop.
    Agents reduce complexity.
    Until the END.

    Who pays the embodied cost of AI-driven sense-making?
    And why is it never the systems that scale it?

    I write about language, technology and human systems.
    👉 systemic.engineering/who-invit

    #SystemicEngineering #SRE #SREforHumans #SiteReliabilityEngineering #Agents #AI #AIEthics #AI

  11. #Fedihired

    I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.

    I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)

    Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect

    DM and I'd be happy to send my resume or apply at a role you suggest.

    Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. cnbc.com/2025/04/08/fake-job-s

    I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". news.ycombinator.com/item?id=4

  12. #Fedihired

    I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.

    I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)

    Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect

    DM and I'd be happy to send my resume or apply at a role you suggest.

    Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. cnbc.com/2025/04/08/fake-job-s

    I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". news.ycombinator.com/item?id=4

  13. #Fedihired

    I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.

    I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)

    Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect

    DM and I'd be happy to send my resume or apply at a role you suggest.

    Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. cnbc.com/2025/04/08/fake-job-s

    I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". news.ycombinator.com/item?id=4

  14. #Fedihired

    I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.

    I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)

    Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect

    DM and I'd be happy to send my resume or apply at a role you suggest.

    Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. cnbc.com/2025/04/08/fake-job-s

    I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". news.ycombinator.com/item?id=4

  15. #Fedihired

    I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.

    I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)

    Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect

    DM and I'd be happy to send my resume or apply at a role you suggest.

    Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. cnbc.com/2025/04/08/fake-job-s

    I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". news.ycombinator.com/item?id=4

  16. Received this copy of #OReilly #SiteReliabilityEngineering for correctly answering a question on a podcast. It is now available on Y2kChecklist.com and can be optionally signed by Wizards Anonymous.

  17. A big thank you to Skill Share Magazine for featuring our innovative solutions in their latest article, "Adaptive Load Balancing: Enhancing Performance in Dynamic Environments." 🌐

    Explore how our dynamic solutions are shaping the future of network optimization!

    relianoid.com/about-us/reliano

  18. Master Chaos Engineering to build resilient distributed systems. Explore hypothesis testing, blast radius control, and tools like AWS FIS vs. LitmusChaos. hackernoon.com/engineering-res #sitereliabilityengineering

  19. A year and change in and i made my first prod outage! Yay large DNS TTL's! It was my fault for not realizing it beforehand but .. sucks so much.

    #SRE #SiteReliabilityEngineering #sysadmin

  20. As #DevOps has evolved from “nice to have” to “must-have”, organizations need to evolve their practices using #SiteReliabilityEngineering & #PlatformEngineering.

    Getting the balance right is hard and necessary!

    Insights on #InfoQ: bit.ly/484Rw0t

    #SoftwareDevelopment

  21. do you wish numbers were more stressful? become a site reliability engineer! numbers will be so stressful and you can't explain why to 80% of the general public. :)

    become an #SRE today!

    #SiteReliabilityEngineer #SiteReliabilityEngineering #iwillbeokaybutwearecuttingitsuperclose

  22. #CaseStudy – find out how #Meta enhances its system reliability through advanced investigation tools. It introduces Hawkeye, an AI-assisted tool, which aids in debugging machine learning workflows.

    Learn more: bit.ly/4g5G2iO

    #InfoQ #DevOps #AI #SiteReliabilityEngineering

  23. 🌟 Excited to share our latest feature on Wotpost! 🌟

    Thank you, Wotpost, for highlighting RELIANOID’s commitment to empowering businesses through Site Reliability Engineering (). Our solutions are meticulously crafted to ensure seamless alignment with SRE principles, enabling organizations to establish and sustain robust IT infrastructure.

    relianoid.com/about-us/reliano

  24. If you've ever been on-call, you know that it can be stressful AF! Next week's guest on @geekingout_pod, Ashley Sawatsky of Rootly, talks about the importance of on-call health, and what you can do to prevent trauma and burnout. Episode drops on Feb 20th.

    Catch the YouTube premiere 👉 buff.ly/3SXSyah, or subscribe through your fave podcasting app!

    #oncall #siteReliabilityEngineering #incidentResponse #incidentManagement

  25. As #DevOps has evolved from “nice to have” to “must-have”, organizations need to evolve their practices using #SiteReliabilityEngineering & #PlatformEngineering.

    Getting the balance right is hard and necessary!

    Insights on #InfoQ: bit.ly/484Rw0t

    #SoftwareDevelopment

  26. What happens when you're an Observability vendor migrating to @opentelemetry? @jea knows exactly what that's like, as he shares the story of how he worked on migrating to OpenTelemetry at ServiceNow Cloud Observability (formerly Lightstep).

    📺: youtu.be/pHHINe9D94w

    #observability #openTelemetry #o11y #yttech #techvideos #sitereliabilityengineering #otip #otelInPractice #otelEndUserWorkingGroup

  27. I'm looking forward to speaking at EMEA, and discuss things to consider when adopting tools for your .
    May the open source be with you 😎
    usenix.org/conference/srecon23

  28. Happy Friday!! In case you missed yesterday’s webinar on What 2022 Taught Us About SRE’s Future, you can catch the recording here: ⬇️⬇️⬇️

    youtu.be/o0Pgo2fWIUc

    The panel includes @anamedina, @austinlparker, KC Tessarek, and Chad Beaudin, Mitch Ashley, and me.

    #SiteReliabilityEngineering #observability #reliability #ReliabilityEngineering

  29. Out this week - @anamedina and I wrote an article in The New Stack on Translating Failures into SLOs, based on our talk of the same name. Check it out! 👇

    thenewstack.io/translating-fai

    And you can always check out the talk version here: youtu.be/Mgzt4bq0JU4?si=KnJeNM

  30. If you've never done SLOs before, they can be scary! 🙀 But no need to panic, because @anamedina and I have got you covered! Check out our SLO primer article on @thenewstack, hot off the press! 🔥 

    thenewstack.io/demystifying-se

    #siteReliabilityEngineering #serviceLevelObjectives

  31. If you've never done SLOs before, they can be scary! 🙀 But no need to panic, because @anamedina and I have got you covered! Check out our SLO primer article on @thenewstack, hot off the press! 🔥 

    thenewstack.io/demystifying-se

    #siteReliabilityEngineering #serviceLevelObjectives

  32. If you've never done SLOs before, they can be scary! 🙀 But no need to panic, because @anamedina and I have got you covered! Check out our SLO primer article on @thenewstack, hot off the press! 🔥 

    thenewstack.io/demystifying-se

     

  33. If you've never done SLOs before, they can be scary! 🙀 But no need to panic, because @anamedina and I have got you covered! Check out our SLO primer article on @thenewstack, hot off the press! 🔥 

    thenewstack.io/demystifying-se

    #siteReliabilityEngineering #serviceLevelObjectives

  34. Netflix operates one of the most advanced multi-region active-active architectures on AWS, designed for global resilience, fault isolation, and continuous availability.

    This article explores key lessons in:
    • Distributed systems design
    • Eventual consistency
    • Region isolation
    • Cloud scalability strategies

    shorturl.at/H6PkW

    #AWS #DevOps #CloudArchitecture #DistributedSystems #SiteReliabilityEngineering #Microservices #Scalability #Tech