#sitereliabilityengineering — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #sitereliabilityengineering, aggregated by home.social.
-
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
Reliability is not an engineering goal. It is a leadership decision. #SRE #SiteReliabilityEngineering #Leadership #CIO #DigitalTransformation #Resilience #ITStrategy #EnterpriseIT #TechnologyLeadership #OperationalExcellence
https://stayingalive.in/cataloguing-strategic-innov/reliability-is-a-business.html -
🚀 Do you know how to define release pipelines? Have experience in building environments from developer sandboxes to production? Then you're the right person to talk at DevConf.CZ 2025!
Any tools and best practices in #DevOps and #Automation, agile development practices, continous application development or testing strategies are welcome!
👉 Submit your proposal now at https://pretalx.devconf.info/devconf-cz-2025/cfp
#Ansible, #AIOps, #CI/CD, #Tekton, #ArgoCD, #SiteReliabilityEngineering
-
🚀 Do you know how to define release pipelines? Have experience in building environments from developer sandboxes to production? Then you're the right person to talk at DevConf.CZ 2025!
Any tools and best practices in #DevOps and #Automation, agile development practices, continous application development or testing strategies are welcome!
👉 Submit your proposal now at https://pretalx.devconf.info/devconf-cz-2025/cfp
#Ansible, #AIOps, #CI/CD, #Tekton, #ArgoCD, #SiteReliabilityEngineering
-
Reliability is not an engineering goal. It is a leadership decision. #SRE #SiteReliabilityEngineering #Leadership #CIO #DigitalTransformation #Resilience #ITStrategy #EnterpriseIT #TechnologyLeadership #OperationalExcellence
https://stayingalive.in/cataloguing-strategic-innov/reliability-is-a-business.html -
#TechDebt isn't something you "clean up".
It's something you inherit.Old budgets.
Old decisions.
Old survival strategies.
(Patterns go brrr.)I rewrote my tech-debt essay and published it on #Substack.
It's about why planning feels like necromancy,
why teams repeat failure modes,
and how language becomes infrastructure.If you've ever thought
"this technically works, but something’s off":
this is for you.👉 Tech Debt Isn't Bad Code—It's Encoded Legacy Patterns
📎 https://systemicengineering.substack.com/p/tech-debt-and-encoded-legacy-patterns#SRE #SiteReliabilityEngineering #HumanSystems #SystemsThinking
-
Reliability.
Consistent results under load.#SiteReliabilityEngineering.
..Your team is a #DistributedSystem.
Language is the transport layer.
And truth is local.
(Site.)#TechDebt slows down delivery.
Decisions are unowned.
And people burn out.
(Reliability.)Divergent realities are a primary (in)variant of human systems.
Linguistic precision counters entropy accruing ambiguity.
And coherence is regulative.
(Engineering.)..
Intrigued?
I write about language, technology and #HumanSystems.
👉 https://systemic.engineering/trauma-awareness/ -
Agents enter the room.
Quick! What do you do?
..
(Don't look at me.)Agents are non-embodied actors in human systems.
Agents receive context as input.
Agents decide, execute, loop.
Agents reduce complexity.
Until the END.Who pays the embodied cost of AI-driven sense-making?
And why is it never the systems that scale it?I write about language, technology and human systems.
👉 https://systemic.engineering/who-invited-the-agent-oh-god-smith-will-suffice/#SystemicEngineering #SRE #SREforHumans #SiteReliabilityEngineering #Agents #AI #AIEthics #AI
-
⚠️Massive outage hits Australia's second-largest telecom provider, leaving millions stranded without mobile and internet services. Imagine that's happening to you! Let's explain and try to avoid it:
https://www.relianoid.com/blog/australian-network-failure-millions-of-users-affected/
#TelecomOutage #SiteReliability #RELIANOID #TelecomDisruption #NetworkOutage #TechDowntime #ServiceRestoration #SiteReliabilityEngineering #HighAvailability #TelecomResilience #TechFailures #NetworkReliability #Australia #Australiaattack #outage #vulnerabilities -
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
Received this copy of #OReilly #SiteReliabilityEngineering for correctly answering a question on a podcast. It is now available on https://Y2kChecklist.com and can be optionally signed by Wizards Anonymous.
-
A big thank you to Skill Share Magazine for featuring our innovative solutions in their latest article, "Adaptive Load Balancing: Enhancing Performance in Dynamic Environments." 🌐
Explore how our dynamic solutions are shaping the future of network optimization!
#Networking #AdaptiveLoadBalancing #SRE #NetworkPerformance #TechInnovation #Scalability #RELIANOID #SiteReliabilityEngineering #DigitalTransformation
https://www.relianoid.com/about-us/relianoid-related-articles/
-
Master Chaos Engineering to build resilient distributed systems. Explore hypothesis testing, blast radius control, and tools like AWS FIS vs. LitmusChaos. https://hackernoon.com/engineering-resilience-a-deep-dive-into-chaos-engineering-in-distributed-systems #sitereliabilityengineering
-
Learn how observability before migration reduces outages, sets clear SLOs, and makes enterprise modernizations predictable and safe. https://hackernoon.com/instrument-then-migrate-observability-lessons-from-mobile-monitoring-vans-to-fortune-100-apps #sitereliabilityengineering
-
A year and change in and i made my first prod outage! Yay large DNS TTL's! It was my fault for not realizing it beforehand but .. sucks so much.
-
As #DevOps has evolved from “nice to have” to “must-have”, organizations need to evolve their practices using #SiteReliabilityEngineering & #PlatformEngineering.
Getting the balance right is hard and necessary!
Insights on #InfoQ: https://bit.ly/484Rw0t
-
do you wish numbers were more stressful? become a site reliability engineer! numbers will be so stressful and you can't explain why to 80% of the general public. :)
become an #SRE today!
#SiteReliabilityEngineer #SiteReliabilityEngineering #iwillbeokaybutwearecuttingitsuperclose
-
#CaseStudy – find out how #Meta enhances its system reliability through advanced investigation tools. It introduces Hawkeye, an AI-assisted tool, which aids in debugging machine learning workflows.
Learn more: https://bit.ly/4g5G2iO
-
🌟 Excited to share our latest feature on Wotpost! 🌟
Thank you, Wotpost, for highlighting RELIANOID’s commitment to empowering businesses through Site Reliability Engineering (#SRE). Our solutions are meticulously crafted to ensure seamless alignment with SRE principles, enabling organizations to establish and sustain robust IT infrastructure.
#SiteReliabilityEngineering #ITInfrastructure #TechInnovation #RELIANOID #BusinessEmpowerment
https://www.relianoid.com/about-us/relianoid-related-articles/
-
If you've ever been on-call, you know that it can be stressful AF! Next week's guest on @geekingout_pod, Ashley Sawatsky of Rootly, talks about the importance of on-call health, and what you can do to prevent trauma and burnout. Episode drops on Feb 20th.
Catch the YouTube premiere 👉 https://buff.ly/3SXSyah, or subscribe through your fave podcasting app!
#oncall #siteReliabilityEngineering #incidentResponse #incidentManagement
-
As #DevOps has evolved from “nice to have” to “must-have”, organizations need to evolve their practices using #SiteReliabilityEngineering & #PlatformEngineering.
Getting the balance right is hard and necessary!
Insights on #InfoQ: https://bit.ly/484Rw0t
-
Out this week - @anamedina and I wrote an article in The New Stack on Translating Failures into SLOs, based on our #SLOConf talk of the same name. Check it out! 👇
https://thenewstack.io/translating-failures-into-service-level-objectives/
And you can always check out the talk version here: https://youtu.be/Mgzt4bq0JU4?si=KnJeNMF5OAGyd1Gy
-
What happens when you're an Observability vendor migrating to @opentelemetry? @jea knows exactly what that's like, as he shares the story of how he worked on migrating to OpenTelemetry at ServiceNow Cloud Observability (formerly Lightstep).
📺: https://youtu.be/pHHINe9D94w
#observability #openTelemetry #o11y #yttech #techvideos #sitereliabilityengineering #otip #otelInPractice #otelEndUserWorkingGroup
-
Issue 20 of SRE Newsletter will be published on 2nd of September, but if you’re curious, you can check out what it’ll cover
https://open.substack.com/pub/eutechdigest/p/issue-20-07a?utm_campaign=post&utm_medium=web
#sre #sitereliabilityengineering #devops #golang #go #tech #newsletter
-
I'm looking forward to speaking at #SREcon EMEA, and discuss things to consider when adopting #OpenSource tools for your #SiteReliabilityEngineering #SRE.
May the open source be with you 😎
https://www.usenix.org/conference/srecon23emea/presentation/horovits -
Blameless raises $30M to guide companies through their software lifecycle - Site reliability engineering platform Blameless announced Tuesday it raised $30 mi... - http://feedproxy.google.com/~r/Techcrunch/~3/EcprV_VLKAE/ #sitereliabilityengineering #lightspeedventurepartners #softwaredevelopment #softwareengineering #thirdpointventures #venturecapital #recentfunding #danmoskowitz #kurtanderson #vasnatarajan #enterprise #ravimhatre #developer #blameless #startups #lyonwong #sanmateo
-
I'll be speaking at DevOps Monthly Online on Tuesday April 4th, 2023. Details below. Hope to see y'all there!
https://www.meetup.com/devopsto/events/291457685/
#devOps #observability #o11y #reliability #siteReliabilityEngineering #womenInTech #latinasInTech
-
https://www.europesays.com/ie/162089/ You Are Asking the Wrong Questions (About Reliability and SRE) #Arts #ArtsAndDesign #ArtsAndDesign #ArtsDesign #Design #devops #Éire #Entertainment #IE #InfoQ #InfoQDevSummit #InfoQDevSummitBoston2025 #Ireland #QConSoftwareDevelopmentConference #SiteReliabilityEngineering #SreQuestions #Transcripts
-
Happy Friday!! In case you missed yesterday’s webinar on What 2022 Taught Us About SRE’s Future, you can catch the recording here: ⬇️⬇️⬇️
The panel includes @anamedina, @austinlparker, KC Tessarek, and Chad Beaudin, Mitch Ashley, and me.
#SiteReliabilityEngineering #observability #reliability #ReliabilityEngineering
-
@anamedina and I were so happy to speak with the always super insightful @divineops in this week's episode of @oncallmemaybe As one of the early champions of the #DevOps movement, her episode is LOADED with wisdom and takeaways.
#techGirls #womenInTech #sre #sitereliabilityengineering
https://oncallmemaybe.com/episodes/devopsify-with-sasha-rosenbaum-of-ergonautic
-
Reliability is not an engineering goal. It is a leadership decision. #SRE #SiteReliabilityEngineering #Leadership #CIO #DigitalTransformation #Resilience #ITStrategy #EnterpriseIT #TechnologyLeadership #OperationalExcellence
https://stayingalive.in/cataloguing-strategic-innov/reliability-is-a-business.html -
Out this week - @anamedina and I wrote an article in The New Stack on Translating Failures into SLOs, based on our #SLOConf talk of the same name. Check it out! 👇
https://thenewstack.io/translating-failures-into-service-level-objectives/
And you can always check out the talk version here: https://youtu.be/Mgzt4bq0JU4?si=KnJeNMF5OAGyd1Gy
-
Out this week - @anamedina and I wrote an article in The New Stack on Translating Failures into SLOs, based on our #SLOConf talk of the same name. Check it out! 👇
https://thenewstack.io/translating-failures-into-service-level-objectives/
And you can always check out the talk version here: https://youtu.be/Mgzt4bq0JU4?si=KnJeNMF5OAGyd1Gy
-
Out this week - @anamedina and I wrote an article in The New Stack on Translating Failures into SLOs, based on our #SLOConf talk of the same name. Check it out! 👇
https://thenewstack.io/translating-failures-into-service-level-objectives/
And you can always check out the talk version here: https://youtu.be/Mgzt4bq0JU4?si=KnJeNMF5OAGyd1Gy
-
Out this week - @anamedina and I wrote an article in The New Stack on Translating Failures into SLOs, based on our #SLOConf talk of the same name. Check it out! 👇
https://thenewstack.io/translating-failures-into-service-level-objectives/
And you can always check out the talk version here: https://youtu.be/Mgzt4bq0JU4?si=KnJeNMF5OAGyd1Gy
-
If you've never done SLOs before, they can be scary! 🙀 But no need to panic, because @anamedina and I have got you covered! Check out our SLO primer article on @thenewstack, hot off the press! 🔥
https://thenewstack.io/demystifying-service-level-objectives-for-you-and-me/
-
If you've never done SLOs before, they can be scary! 🙀 But no need to panic, because @anamedina and I have got you covered! Check out our SLO primer article on @thenewstack, hot off the press! 🔥
https://thenewstack.io/demystifying-service-level-objectives-for-you-and-me/
-
If you've never done SLOs before, they can be scary! 🙀 But no need to panic, because @anamedina and I have got you covered! Check out our SLO primer article on @thenewstack, hot off the press! 🔥
https://thenewstack.io/demystifying-service-level-objectives-for-you-and-me/
-
If you've never done SLOs before, they can be scary! 🙀 But no need to panic, because @anamedina and I have got you covered! Check out our SLO primer article on @thenewstack, hot off the press! 🔥
https://thenewstack.io/demystifying-service-level-objectives-for-you-and-me/
-
Happy Monday, y'all! I was asked to participate on a panel on SLOs for #SLOConf last week. Check out the recording! Panel moderated by Donnie Berkholz.
-
Check it out! @anamedina and I will be speaking at SLOConf this year!!
-
Netflix operates one of the most advanced multi-region active-active architectures on AWS, designed for global resilience, fault isolation, and continuous availability.
This article explores key lessons in:
• Distributed systems design
• Eventual consistency
• Region isolation
• Cloud scalability strategies#AWS #DevOps #CloudArchitecture #DistributedSystems #SiteReliabilityEngineering #Microservices #Scalability #Tech