#incidentmanagement — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #incidentmanagement, aggregated by home.social.
-
Mean time to repair directly impacts revenue and trust. When automation cuts MTTR by over 50%, the business case becomes clear: fewer escalations, less downtime, and calmer teams.
-
The 2024 CrowdStrike outage caused a worldwide Windows Blue Screen crash, impacting airlines, banks, and enterprises.
This deep dive explains how DevOps & SRE teams mitigated impact, recovered systems, and prevented total failure.
🔗 https://shorturl.at/VLqxz#CrowdStrikeOutage #DevOps #SRE #IncidentManagement #CyberResilience #CloudOps #PostMortem #ReliabilityEngineering #aws
-
The 2024 CrowdStrike outage caused a worldwide Windows Blue Screen crash, impacting airlines, banks, and enterprises.
This deep dive explains how DevOps & SRE teams mitigated impact, recovered systems, and prevented total failure.
🔗 https://shorturl.at/VLqxz#CrowdStrikeOutage #DevOps #SRE #IncidentManagement #CyberResilience #CloudOps #PostMortem #ReliabilityEngineering #aws
-
#Development #Findings
The Pragmatic Engineer 2025 Survey (Part 3) · Which tools do software engineers use today? https://ilo.im/167n2s_____
#Observability #IncidentManagement #Experimentation #TechStack #Tooling #Frameworks #DevOps #WebDev #Frontend -
Auch 2026 findet wieder ein #GI-SPRING-Graduiertenworkshop der Fachgruppe Security - Intrusion Detection and Response (SIDAR) statt. Diesmal am 21. und 22.04.2026 in #Heidelberg.
Zu den Themen gehören #VulnerabilityAssessment, #ThreatIntelligence, #IntrusionDetection, #Malware, #IncidentManagement, #WirelessSecurity, #DigitalForensics usw.
Einreichungen werden bis zum 15.03.2026 angenommen.
-
Auch 2026 findet wieder ein #GI-SPRING-Graduiertenworkshop der Fachgruppe Security - Intrusion Detection and Response (SIDAR) statt. Diesmal am 21. und 22.04.2026 in #Heidelberg.
Zu den Themen gehören #VulnerabilityAssessment, #ThreatIntelligence, #IntrusionDetection, #Malware, #IncidentManagement, #WirelessSecurity, #DigitalForensics usw.
Einreichungen werden bis zum 15.03.2026 angenommen.
-
Auch 2026 findet wieder ein #GI-SPRING-Graduiertenworkshop der Fachgruppe Security - Intrusion Detection and Response (SIDAR) statt. Diesmal am 21. und 22.04.2026 in #Heidelberg.
Zu den Themen gehören #VulnerabilityAssessment, #ThreatIntelligence, #IntrusionDetection, #Malware, #IncidentManagement, #WirelessSecurity, #DigitalForensics usw.
Einreichungen werden bis zum 15.03.2026 angenommen.
-
Auch 2026 findet wieder ein #GI-SPRING-Graduiertenworkshop der Fachgruppe Security - Intrusion Detection and Response (SIDAR) statt. Diesmal am 21. und 22.04.2026 in #Heidelberg.
Zu den Themen gehören #VulnerabilityAssessment, #ThreatIntelligence, #IntrusionDetection, #Malware, #IncidentManagement, #WirelessSecurity, #DigitalForensics usw.
Einreichungen werden bis zum 15.03.2026 angenommen.
-
Today's AWS outage was a stark reminder: what happens when the tools you rely on to manage incidents... are part of the incident?
When Slack, Zoom, PagerDuty, and even Statuspage are impacted, how do you get your response team re-connected to solve the underlying problem? Once they're talking to each other, they can improvise a response, but that first step of re-establishing contact is critical.
This isn't just a hypothetical. It's a real-world scenario that can paralyze even the most prepared organizations. Relying on a plan that's tucked away in a long-forgotten document is a recipe for disaster.
Here's what I recommend to the leaders I advise:
🔹 Have a "Rally Point" Plan: Don't just have a backup concept; have a pre-defined, communicated, and accessible fallback plan. Every second counts in an incident, and you can't waste time figuring out where to communicate. If you normally use Slack and Zoom, then think Google Meet or Microsoft Teams for your backup, and vice versa. Maybe even an old-fashioned conference call bridge. The key is that everyone knows where to go, when the normal places aren't working.
🔹 Make it Accessible: Your plan is useless if it's on a server that nobody can get to at the moment. Laminated wallet cards, a shared password vault with offline access, or a regularly updated file on every employee's laptop are all viable options.
🔹 Practice, Practice, Practice: Fire drills aren't just for fires. Run drills for your fallback communication plan. This ensures everyone remembers it exists and that the mechanisms still work.
🔹 Don't Forget Security: Assume that your fallback channel is compromised, and that outsiders are listening in. Use it just as a rendezvous point to direct responders to more secure, authenticated channels, where you can validate every participant. Don't discuss sensitive information in the open.
Incidents are costly, not just in revenue, but in reputation and team morale. Proactive preparation isn't a luxury; it's a necessity.
What's your team's communication fallback plan? Share your thoughts in the comments below. 👇
#IncidentManagement #BusinessContinuity #SiteReliability #DevOps #AWSOutage
-
Agile ITSM turns rigid processes into rapid value—what’s your next move? #AgileITSM #DigitalTransformation #ITLeadership #ModernIT #DevOps #ITOps #AgileMindset #ServiceExcellence #IncidentManagement #ContinuousImprovement #Automation #SelfService #Collaboration #Swarming #MTTR #MTTD #Metrics #Innovation #CustomerSatisfaction
https://medium.com/@sanjay.mohindroo66/beyond-the-ticket-agile-itsm-for-speed-clarity-and-impact-550a98882cb1 -
Agile ITSM turns rigid processes into rapid value—what’s your next move? #AgileITSM #DigitalTransformation #ITLeadership #ModernIT #DevOps #ITOps #AgileMindset #ServiceExcellence #IncidentManagement #ContinuousImprovement #Automation #SelfService #Collaboration #Swarming #MTTR #MTTD #Metrics #Innovation #CustomerSatisfaction
https://medium.com/@sanjay.mohindroo66/beyond-the-ticket-agile-itsm-for-speed-clarity-and-impact-550a98882cb1 -
Agile ITSM turns rigid processes into rapid value—what’s your next move? #AgileITSM #DigitalTransformation #ITLeadership #ModernIT #DevOps #ITOps #AgileMindset #ServiceExcellence #IncidentManagement #ContinuousImprovement #Automation #SelfService #Collaboration #Swarming #MTTR #MTTD #Metrics #Innovation #CustomerSatisfaction
https://medium.com/@sanjay.mohindroo66/beyond-the-ticket-agile-itsm-for-speed-clarity-and-impact-550a98882cb1 -
Agile ITSM turns rigid processes into rapid value—what’s your next move? #AgileITSM #DigitalTransformation #ITLeadership #ModernIT #DevOps #ITOps #AgileMindset #ServiceExcellence #IncidentManagement #ContinuousImprovement #Automation #SelfService #Collaboration #Swarming #MTTR #MTTD #Metrics #Innovation #CustomerSatisfaction
https://medium.com/@sanjay.mohindroo66/beyond-the-ticket-agile-itsm-for-speed-clarity-and-impact-550a98882cb1 -
Release It! by Michael T. Nygard
"Manage perceptions after a major #incident It’s as important as managing the incident itself."
-
In August 2020, @SchizoDuckie and I published what was to become the first of a series of articles or posts called "No Need to Hack When It's Leaking."
In today's installment, I bring you "No Need to Hack When It's Leaking: Brandt Kettwick Defense Edition." It chronicles efforts by @JayeLTee, @masek, and I to alert a Minnesota law firm to lock down their exposed files, some of which were quite sensitive.
Read the post and see how even the state's Bureau of Criminal Apprehension had trouble getting this law firm to respond appropriately.
Great thanks to the Minnesota Bureau of Criminal Apprehension for their help on this one, and to @TonyYarusso and @bkoehn for their efforts.
#dataleak #misconfiguration #incidentresponse #incidentmanagement #responsibledisclosure #securityalert #infosec
-
🚨 Cyber threats are evolving fast! 74% of CISOs are increasing their crisis simulation budgets in 2025 to stay ahead. With high-profile breaches on the rise, organizations must test and refine their response strategies.
At RELIANOID, we provide the tools to enhance cyber resilience and ensure businesses are always prepared. 🛡️
#CyberSecurity #CrisisResponse #IncidentManagement #CISO #RELIANOID
https://www.relianoid.com/blog/cisos-are-increasing-crisis-simulation-budgets/ -
Mastering #TelemetryPipelines ensures high #ApplicationPerformance, cost efficiency, and security compliance. Implement best practices and stay ahead in #Observability & #Monitoring. #CloudComputing #DevOps #AI #Cybersecurity #ITGovernance #DigitalTransformation #DataAnalytics #Logging #IncidentManagement
https://medium.com/@sanjay.mohindroo66/how-to-use-telemetry-pipelines-to-maintain-application-performance-9d0972585d81 -
At RELIANOID, we help teams move from:
🚨 Chaos (fragmented tools & manual processes)
➡️ Proactive resilience (collaborative, data-driven systems).Break the "doom loop" of incident management. Let's build a culture where incidents = opportunities. 💡
#IncidentManagement #ITResilience #RELIANOID
https://www.relianoid.com/blog/transforming-incident-management-with-relianoids-support-services/ -
Mastering #TelemetryPipelines ensures high #ApplicationPerformance, cost efficiency, and security compliance. Implement best practices and stay ahead in #ITGovernance #DigitalTransformation #DataAnalytics #Logging #IncidentManagement
https://medium.com/@sanjay.mohindroo66/how-to-use-telemetry-pipelines-to-maintain-application-performance-9d0972585d81 -
From the Better-Late-Than-Never Department:
"Washington County is preparing to implement a new policy on how to respond to future cybersecurity attacks after a ransomware strike crippled the county government for more than two weeks earlier this year.
County solicitor Gary Sweat is asking the commissioners to consider approving a “business continuity and disaster contingency” plan that would have a protocol for county workers and its IT department to follow in the event of another cyber emergency."
As a reminder, they paid $350k ransom to ransomware gang to get decryptor key.
#databreach #ransomware #govsec #riskassessment #disasterplan #IncidentManagement #cybersecurity
-
"@PagerDuty Expands #GenerativeAI Solutions with PagerDuty Advance to Mitigate #Risk of Operational #Outages"
Love this! In #Observability & #IncidentManagement #AI should do more for #CloudOps #SRE #DevOps #ITOps than read docs. It should do the work!
-
"@PagerDuty Expands #GenerativeAI Solutions with PagerDuty Advance to Mitigate #Risk of Operational #Outages"
Love this! In #Observability & #IncidentManagement #AI should do more for #CloudOps #SRE #DevOps #ITOps than read docs. It should do the work!!
-
Change Healthcare submitted a breach notification to #HHS on July 19. They report the number of patients affected as "500" (a marker for "We have no friggin' idea how many and we'll get back to you at some date before the end of civilization maybe.").
They didn't comply with the "no later than 60 calendar days" requirement and I'm not sure what good a "500" report does anyone.
#databreach #HIPAA #HITECH #HealthSec #ALPHV #ransomware #cybersecurity #incidentmanagement
-
Julia Thoreson at Bloomberg sharing “Incident Management: Lessons from Emergency Services” breaking down how the lessons learned in emergency services can apply to incident management in technical systems #monitorama #monitorama24 #incidentmanagement
-
If you've ever been on-call, you know that it can be stressful AF! Next week's guest on @geekingout_pod, Ashley Sawatsky of Rootly, talks about the importance of on-call health, and what you can do to prevent trauma and burnout. Episode drops on Feb 20th.
Catch the YouTube premiere 👉 https://buff.ly/3SXSyah, or subscribe through your fave podcasting app!
#oncall #siteReliabilityEngineering #incidentResponse #incidentManagement
-
CW: Incident management / birdsite
This thread and the nested thread could be the basis of a lot of good writing about incident management.
It would be super wrong to think the question is, "How many engineers do they need to maintain Twitter?" A better question might be, "How clear an understanding do they have of how to maintain Twitter, now that so many people have left?" #IncidentManagement #Clausewitz