Hrefna (DHC) · @hrefna
1180 followers · 4517 posts · Server hachyderm.io

I can frame this as a blameless postmortem, but a "blameless postmortem" should never, ever mean "the people with power get a pass."

Blameless are a way of getting to the root cause, they are a way of saying "if you push a button and it brings down production, the problem in most cases is that there was a button that can bring down production, not that you pushed it"

Your conclusion can still be "the leader made the call to launch because they wanted to give good news to Reagan."

#postmortems

Last updated 2 years ago

synlogic · @synlogic
87 followers · 1531 posts · Server toot.io

heard that Twitter is DDoSing itself (?)

This is a good opportunity to announce I specialize in software perf & scalability. Reducing hosting costs. And parachuting in to solve hard bugs or otherwise "rescue" sites or projects farked up by a prior approach

as a paid consultant


















#ddos #twitter #performance #scalability #scaling #tuning #costreduction #resourceminimization #troubleshooting #rescues #rewrites #systems #rootcauseanalysis #regressions #postmortems #architecture #efficiency #sre

Last updated 2 years ago

dan slimmon · @danslimmon
232 followers · 153 posts · Server hachyderm.io
dan slimmon · @danslimmon
227 followers · 146 posts · Server hachyderm.io

"Eventually this customer has had enough. They leave. This represents both a sizable blow to revenue and a scathing indictment of your product’s reliability at scale. But, on the bright side, both MTTR and MTBF benefit enormously! That’ll look great on the quarterly slide deck." (~700w)

blog.danslimmon.com/2023/04/04

#sre #devops #incidentresponse #postmortems

Last updated 2 years ago

🌳ybaumy 🌳🐈‍⬛ · @ybaumy
134 followers · 1047 posts · Server digitalcourage.social

@jorge I love .. good read and post. Thanks

#postmortems

Last updated 2 years ago

Touraine Tech · @tourainetech
99 followers · 267 posts · Server piaille.fr

RT @dadideo
Une autre conférence bien intéressante de @tourainetech , c'était les explications de @QuesnelLise sur les , élément important des , mais aussi le fameux /#Monitor de la boucle
youtu.be/zBjBq6uxp3M

#tnt23 #postmortems #Ops #feedback #devops

Last updated 2 years ago

Simon :verified: · @simon
249 followers · 199 posts · Server indiehackers.social

We do that anyway after incidents with , but good time to reflect on procedures that we typically do.

Can absolutely recommend this practice, it also is a great time for the team to share past stories with each other...
[4/6]

#incident #postmortems

Last updated 3 years ago

Ben Cordero · @bencord0
186 followers · 45 posts · Server nfra.club

There's still value in low-technical postmortems.

What made this incident low impact? has your team implemented various safety nets to reduce harmful effects?

How did you know that a rollback was the right thing to do?
Could you have implemented a fix-forward instead?

Who else did you need to involve? or were you able to fully execute the incident and any runbooks by yourself without disrupting anyone else?

#incidents #incidentresponse #incidentmanagement #postmortems #icm #irm

Last updated 3 years ago

ConstantOrbit · @constantorbit
376 followers · 398 posts · Server hachyderm.io

@nova @hazelweakly As a seasoned developer/etc who's also had to do devops work, I deeply appreciate your postmortems. I love the transparency with the community.

And SO well done! And I'm actually going to borrow some of the sections for our company's. 💯 ❤️

#postmortems #devops #hachyderm

Last updated 3 years ago

Rich Lafferty 🐀 · @mendel
450 followers · 754 posts · Server hachyderm.io

pleased with this slide of mine from our monthly major incident meta-review, encouraging us towards and away from focusing on incident statistics

the first half says: "The insights generated from reviewing incidents are primarily qualitative, because incidents are emergent behavior"

the second half says "There is no relationship between the impact of an incident and the quality of insights generated through the review process"

#LearningFromIncidents #postmortems #sre #incidentresponse

Last updated 3 years ago

Jeffrey Yasskin · @jyasskin
188 followers · 91 posts · Server hachyderm.io

Just 'ed to my 6yo.

#eli5 #postmortems

Last updated 3 years ago

Ed S · @EdS
481 followers · 3264 posts · Server mastodon.sdf.org

2011, Los Alamos, at a for-profit nuclear lab:
"Technicians settled on what seemed like a surefire way to win praise from their bosses: In a hi-tech testing and manufacturing building pivotal to sustaining America's nuclear arsenal, they gathered eight rods painstakingly crafted out of plutonium, and positioned them side-by-side on a table to photograph how nice they looked."

science.org/content/article/ne

via news.ycombinator.com/item?id=3


#postmortem #postmortems #nuclearsafety

Last updated 3 years ago

Ed S · @EdS
481 followers · 3264 posts · Server mastodon.sdf.org

of failures in our technology, a whole forum to enjoy, recently updated with today's Cloudflare outage and a collection of other BGP mishaps:

postmortems.info/

(boosts welcome!)

#postmortems #postmortem #bgp #cloudflare

Last updated 3 years ago

MXroute · @mxroute
124 followers · 204 posts · Server freesocial.co

Today we experienced an issue with outbound mail for roughly an hour. The details are too extensive to go into here, but have been posted in our channel at chat.mxroute.com.

#postmortems

Last updated 4 years ago

MXroute · @mxroute
172 followers · 206 posts · Server freesocial.co

Today we experienced an issue with outbound mail for roughly an hour. The details are too extensive to go into here, but have been posted in our channel at chat.mxroute.com.

#postmortems

Last updated 4 years ago

John Goerzen · @jgoerzen
715 followers · 2228 posts · Server floss.social

Jonathan Hall of the Tiny DevOps Guy podcast interviewed me for episode 2! youtube.com/watch?v=-i-zRZ8nRa I discussed how can give us lessons for tech. Some of the topics included , human factors impacting performance, accident chains (in both aviation and IT), safe attitudes, etc.

#postmortems #aviation

Last updated 4 years ago

MXroute · @mxroute
124 followers · 204 posts · Server freesocial.co

Postmortem for Banshee outage this morning available in channel at chat.mxroute.com (uses portal.mxroute.com login).

#postmortems

Last updated 5 years ago

MXroute · @mxroute
172 followers · 206 posts · Server freesocial.co

Postmortem for Banshee outage this morning available in channel at chat.mxroute.com (uses portal.mxroute.com login).

#postmortems

Last updated 5 years ago

MXroute · @mxroute
124 followers · 204 posts · Server freesocial.co

Arrow, Lucy, Safari, and Friday are back online. Postmortem in the channel at chat.mxroute.com.

#postmortems

Last updated 5 years ago

MXroute · @mxroute
172 followers · 206 posts · Server freesocial.co

Arrow, Lucy, Safari, and Friday are back online. Postmortem in the channel at chat.mxroute.com.

#postmortems

Last updated 5 years ago