#sitereliabilityengineering — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #sitereliabilityengineering, aggregated by home.social.
-
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
The Engineering Leadership Crisis Nobody Talks About 🚨 #EngineeringLeadership #SoftwareEngineering #PlatformEngineering #TechLeadership #Microservices #SRE
Modern engineering teams are collapsing under platform complexity, AI chaos, organizational scaling failures, and unreliable architectures. This deep technical leadership guide explains how elite engineering leaders manage platform rewrites, reliability crises, organizational chaos, and large-scale modernization without destroying delivery velocity. #SoftwareArchitecture #EngineeringManagement #DevOps #CloudComputing #Leadership -
Reliability is not an engineering goal. It is a leadership decision. #SRE #SiteReliabilityEngineering #Leadership #CIO #DigitalTransformation #Resilience #ITStrategy #EnterpriseIT #TechnologyLeadership #OperationalExcellence
https://stayingalive.in/cataloguing-strategic-innov/reliability-is-a-business.html -
Reliability is not an engineering goal. It is a leadership decision. #SRE #SiteReliabilityEngineering #Leadership #CIO #DigitalTransformation #Resilience #ITStrategy #EnterpriseIT #TechnologyLeadership #OperationalExcellence
https://stayingalive.in/cataloguing-strategic-innov/reliability-is-a-business.html -
Reliability is not an engineering goal. It is a leadership decision. #SRE #SiteReliabilityEngineering #Leadership #CIO #DigitalTransformation #Resilience #ITStrategy #EnterpriseIT #TechnologyLeadership #OperationalExcellence
https://stayingalive.in/cataloguing-strategic-innov/reliability-is-a-business.html -
Reliability Is a Business Decision.
Reliability is not an engineering goal. It is a leadership decision. #SRE #SiteReliabilityEngineering #Leadership #CIO #DigitalTransformation #Resilience #ITStrategy #EnterpriseIT #TechnologyLeadership #OperationalExcellencehttps://technologytrends60.wordpress.com/2026/05/04/reliability-is-a-business-decision/
-
Netflix operates one of the most advanced multi-region active-active architectures on AWS, designed for global resilience, fault isolation, and continuous availability.
This article explores key lessons in:
• Distributed systems design
• Eventual consistency
• Region isolation
• Cloud scalability strategies#AWS #DevOps #CloudArchitecture #DistributedSystems #SiteReliabilityEngineering #Microservices #Scalability #Tech
-
Netflix operates one of the most advanced multi-region active-active architectures on AWS, designed for global resilience, fault isolation, and continuous availability.
This article explores key lessons in:
• Distributed systems design
• Eventual consistency
• Region isolation
• Cloud scalability strategies#AWS #DevOps #CloudArchitecture #DistributedSystems #SiteReliabilityEngineering #Microservices #Scalability #Tech
-
Master Chaos Engineering to build resilient distributed systems. Explore hypothesis testing, blast radius control, and tools like AWS FIS vs. LitmusChaos. https://hackernoon.com/engineering-resilience-a-deep-dive-into-chaos-engineering-in-distributed-systems #sitereliabilityengineering
-
Master Chaos Engineering to build resilient distributed systems. Explore hypothesis testing, blast radius control, and tools like AWS FIS vs. LitmusChaos. https://hackernoon.com/engineering-resilience-a-deep-dive-into-chaos-engineering-in-distributed-systems #sitereliabilityengineering
-
Master Chaos Engineering to build resilient distributed systems. Explore hypothesis testing, blast radius control, and tools like AWS FIS vs. LitmusChaos. https://hackernoon.com/engineering-resilience-a-deep-dive-into-chaos-engineering-in-distributed-systems #sitereliabilityengineering
-
Master Chaos Engineering to build resilient distributed systems. Explore hypothesis testing, blast radius control, and tools like AWS FIS vs. LitmusChaos. https://hackernoon.com/engineering-resilience-a-deep-dive-into-chaos-engineering-in-distributed-systems #sitereliabilityengineering
-
Master Chaos Engineering to build resilient distributed systems. Explore hypothesis testing, blast radius control, and tools like AWS FIS vs. LitmusChaos. https://hackernoon.com/engineering-resilience-a-deep-dive-into-chaos-engineering-in-distributed-systems #sitereliabilityengineering
-
QCon London 2026: Wrangling Telemetry at Scale, a Guide to Self-Hosted Observability
At QCon London 2026, Colin Douch discussed building and operating self-hosted monitoring stacks, surveyed the current tooling landscape,…
#NewsBeep #News #Technology #CA #Canada #Development #DevOps #DistributedTracing #logging #metrics #Observability #OpenTelemetry #Prometheus #selfhostedobservability #SiteReliabilityEngineering #Telemetry
https://www.newsbeep.com/ca/546560/ -
#TechDebt isn't something you "clean up".
It's something you inherit.Old budgets.
Old decisions.
Old survival strategies.
(Patterns go brrr.)I rewrote my tech-debt essay and published it on #Substack.
It's about why planning feels like necromancy,
why teams repeat failure modes,
and how language becomes infrastructure.If you've ever thought
"this technically works, but something’s off":
this is for you.👉 Tech Debt Isn't Bad Code—It's Encoded Legacy Patterns
📎 https://systemicengineering.substack.com/p/tech-debt-and-encoded-legacy-patterns#SRE #SiteReliabilityEngineering #HumanSystems #SystemsThinking
-
#TechDebt isn't something you "clean up".
It's something you inherit.Old budgets.
Old decisions.
Old survival strategies.
(Patterns go brrr.)I rewrote my tech-debt essay and published it on #Substack.
It's about why planning feels like necromancy,
why teams repeat failure modes,
and how language becomes infrastructure.If you've ever thought
"this technically works, but something’s off":
this is for you.👉 Tech Debt Isn't Bad Code—It's Encoded Legacy Patterns
📎 https://systemicengineering.substack.com/p/tech-debt-and-encoded-legacy-patterns#SRE #SiteReliabilityEngineering #HumanSystems #SystemsThinking
-
#TechDebt isn't something you "clean up".
It's something you inherit.Old budgets.
Old decisions.
Old survival strategies.
(Patterns go brrr.)I rewrote my tech-debt essay and published it on #Substack.
It's about why planning feels like necromancy,
why teams repeat failure modes,
and how language becomes infrastructure.If you've ever thought
"this technically works, but something’s off":
this is for you.👉 Tech Debt Isn't Bad Code—It's Encoded Legacy Patterns
📎 https://systemicengineering.substack.com/p/tech-debt-and-encoded-legacy-patterns#SRE #SiteReliabilityEngineering #HumanSystems #SystemsThinking
-
#TechDebt isn't something you "clean up".
It's something you inherit.Old budgets.
Old decisions.
Old survival strategies.
(Patterns go brrr.)I rewrote my tech-debt essay and published it on #Substack.
It's about why planning feels like necromancy,
why teams repeat failure modes,
and how language becomes infrastructure.If you've ever thought
"this technically works, but something’s off":
this is for you.👉 Tech Debt Isn't Bad Code—It's Encoded Legacy Patterns
📎 https://systemicengineering.substack.com/p/tech-debt-and-encoded-legacy-patterns#SRE #SiteReliabilityEngineering #HumanSystems #SystemsThinking
-
Reliability.
Consistent results under load.#SiteReliabilityEngineering.
..Your team is a #DistributedSystem.
Language is the transport layer.
And truth is local.
(Site.)#TechDebt slows down delivery.
Decisions are unowned.
And people burn out.
(Reliability.)Divergent realities are a primary (in)variant of human systems.
Linguistic precision counters entropy accruing ambiguity.
And coherence is regulative.
(Engineering.)..
Intrigued?
I write about language, technology and #HumanSystems.
👉 https://systemic.engineering/trauma-awareness/ -
Reliability.
Consistent results under load.#SiteReliabilityEngineering.
..Your team is a #DistributedSystem.
Language is the transport layer.
And truth is local.
(Site.)#TechDebt slows down delivery.
Decisions are unowned.
And people burn out.
(Reliability.)Divergent realities are a primary (in)variant of human systems.
Linguistic precision counters entropy accruing ambiguity.
And coherence is regulative.
(Engineering.)..
Intrigued?
I write about language, technology and #HumanSystems.
👉 https://systemic.engineering/trauma-awareness/ -
Reliability.
Consistent results under load.#SiteReliabilityEngineering.
..Your team is a #DistributedSystem.
Language is the transport layer.
And truth is local.
(Site.)#TechDebt slows down delivery.
Decisions are unowned.
And people burn out.
(Reliability.)Divergent realities are a primary (in)variant of human systems.
Linguistic precision counters entropy accruing ambiguity.
And coherence is regulative.
(Engineering.)..
Intrigued?
I write about language, technology and #HumanSystems.
👉 https://systemic.engineering/trauma-awareness/ -
Reliability.
Consistent results under load.#SiteReliabilityEngineering.
..Your team is a #DistributedSystem.
Language is the transport layer.
And truth is local.
(Site.)#TechDebt slows down delivery.
Decisions are unowned.
And people burn out.
(Reliability.)Divergent realities are a primary (in)variant of human systems.
Linguistic precision counters entropy accruing ambiguity.
And coherence is regulative.
(Engineering.)..
Intrigued?
I write about language, technology and #HumanSystems.
👉 https://systemic.engineering/trauma-awareness/ -
Agents enter the room.
Quick! What do you do?
..
(Don't look at me.)Agents are non-embodied actors in human systems.
Agents receive context as input.
Agents decide, execute, loop.
Agents reduce complexity.
Until the END.Who pays the embodied cost of AI-driven sense-making?
And why is it never the systems that scale it?I write about language, technology and human systems.
👉 https://systemic.engineering/who-invited-the-agent-oh-god-smith-will-suffice/#SystemicEngineering #SRE #SREforHumans #SiteReliabilityEngineering #Agents #AI #AIEthics #AI
-
Agents enter the room.
Quick! What do you do?
..
(Don't look at me.)Agents are non-embodied actors in human systems.
Agents receive context as input.
Agents decide, execute, loop.
Agents reduce complexity.
Until the END.Who pays the embodied cost of AI-driven sense-making?
And why is it never the systems that scale it?I write about language, technology and human systems.
👉 https://systemic.engineering/who-invited-the-agent-oh-god-smith-will-suffice/#SystemicEngineering #SRE #SREforHumans #SiteReliabilityEngineering #Agents #AI #AIEthics #AI
-
Agents enter the room.
Quick! What do you do?
..
(Don't look at me.)Agents are non-embodied actors in human systems.
Agents receive context as input.
Agents decide, execute, loop.
Agents reduce complexity.
Until the END.Who pays the embodied cost of AI-driven sense-making?
And why is it never the systems that scale it?I write about language, technology and human systems.
👉 https://systemic.engineering/who-invited-the-agent-oh-god-smith-will-suffice/#SystemicEngineering #SRE #SREforHumans #SiteReliabilityEngineering #Agents #AI #AIEthics #AI
-
Agents enter the room.
Quick! What do you do?
..
(Don't look at me.)Agents are non-embodied actors in human systems.
Agents receive context as input.
Agents decide, execute, loop.
Agents reduce complexity.
Until the END.Who pays the embodied cost of AI-driven sense-making?
And why is it never the systems that scale it?I write about language, technology and human systems.
👉 https://systemic.engineering/who-invited-the-agent-oh-god-smith-will-suffice/#SystemicEngineering #SRE #SREforHumans #SiteReliabilityEngineering #Agents #AI #AIEthics #AI
-
https://www.europesays.com/ie/162089/ You Are Asking the Wrong Questions (About Reliability and SRE) #Arts #ArtsAndDesign #ArtsAndDesign #ArtsDesign #Design #devops #Éire #Entertainment #IE #InfoQ #InfoQDevSummit #InfoQDevSummitBoston2025 #Ireland #QConSoftwareDevelopmentConference #SiteReliabilityEngineering #SreQuestions #Transcripts
-
Learn how observability before migration reduces outages, sets clear SLOs, and makes enterprise modernizations predictable and safe. https://hackernoon.com/instrument-then-migrate-observability-lessons-from-mobile-monitoring-vans-to-fortune-100-apps #sitereliabilityengineering
-
Learn how observability before migration reduces outages, sets clear SLOs, and makes enterprise modernizations predictable and safe. https://hackernoon.com/instrument-then-migrate-observability-lessons-from-mobile-monitoring-vans-to-fortune-100-apps #sitereliabilityengineering
-
Learn how observability before migration reduces outages, sets clear SLOs, and makes enterprise modernizations predictable and safe. https://hackernoon.com/instrument-then-migrate-observability-lessons-from-mobile-monitoring-vans-to-fortune-100-apps #sitereliabilityengineering
-
Learn how observability before migration reduces outages, sets clear SLOs, and makes enterprise modernizations predictable and safe. https://hackernoon.com/instrument-then-migrate-observability-lessons-from-mobile-monitoring-vans-to-fortune-100-apps #sitereliabilityengineering
-
Learn how observability before migration reduces outages, sets clear SLOs, and makes enterprise modernizations predictable and safe. https://hackernoon.com/instrument-then-migrate-observability-lessons-from-mobile-monitoring-vans-to-fortune-100-apps #sitereliabilityengineering
-
Received this copy of #OReilly #SiteReliabilityEngineering for correctly answering a question on a podcast. It is now available on https://Y2kChecklist.com and can be optionally signed by Wizards Anonymous.
-
Received this copy of #OReilly #SiteReliabilityEngineering for correctly answering a question on a podcast. It is now available on https://Y2kChecklist.com and can be optionally signed by Wizards Anonymous.
-
Received this copy of #OReilly #SiteReliabilityEngineering for correctly answering a question on a podcast. It is now available on https://Y2kChecklist.com and can be optionally signed by Wizards Anonymous.
-
Received this copy of #OReilly #SiteReliabilityEngineering for correctly answering a question on a podcast. It is now available on https://Y2kChecklist.com and can be optionally signed by Wizards Anonymous.
-
Received this copy of #OReilly #SiteReliabilityEngineering for correctly answering a question on a podcast. It is now available on https://Y2kChecklist.com and can be optionally signed by Wizards Anonymous.
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
I'm still looking. I'm still at the #VA but I've lost: 2 data engineers, 1 system engineer, 1 cybersecurity engineer, my contracting manager (last friday, no notice) and our director just announced retirement.
I've been applying everywhere for relevant roles. Even in 2021, I had a reply response of around 20:1 . Now, ive put in probably 200. I've got nothing. (well, thats not true. weapons manufacture keeps pinging. no way no how)
Im looking for #cloud #systemsengineering #sitereliabilityengineering #cybersecurity #systemarchitect
DM and I'd be happy to send my resume or apply at a role you suggest.
Aside: stuff like this is why I think I'm in this situation. Basically, fake/AI resumes are drowning the market. https://www.cnbc.com/2025/04/08/fake-job-seekers-use-ai-to-interview-for-remote-jobs-tech-ceos-say.html
I saw this on HN this morning. And it seems terrible for everyone affected, well, except for the fake/AI users with batch apply scripts. Also explains LinkedIN jobs "submitted 1h ago, over 100 applied". https://news.ycombinator.com/item?id=43631384
-
A year and change in and i made my first prod outage! Yay large DNS TTL's! It was my fault for not realizing it beforehand but .. sucks so much.
-
A year and change in and i made my first prod outage! Yay large DNS TTL's! It was my fault for not realizing it beforehand but .. sucks so much.
-
A year and change in and i made my first prod outage! Yay large DNS TTL's! It was my fault for not realizing it beforehand but .. sucks so much.
-
A year and change in and i made my first prod outage! Yay large DNS TTL's! It was my fault for not realizing it beforehand but .. sucks so much.
-
A year and change in and i made my first prod outage! Yay large DNS TTL's! It was my fault for not realizing it beforehand but .. sucks so much.