Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Honestly, incident management is hard. It is a different skill set to straight up SRE management, and all the training in the world won't get you ready. Believe me it's true in AWS too, true in a lot of places.

It's complicated, and the human factor is massive in an unpredicted scenario.

I'm an IM myself, and regularly have to make the call regarding status updates, and also seen how other companies do it.

One of my favourites was a big it infra company sending a notification about an outage 6 hours after the event, after we opened and resolved a ticket with them. Was actually pretty useful but made me laugh all the same



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: