- A structured analysis conducted after a significant incident or outage to determine root causes, document lessons learned, and identify preventive measures. The process involves creating a timeline of events, analyzing contributing factors without assigning blame, and establishing action items to improve system reliability and response procedures. Post mortems are a critical component of incident management and site reliability engineering practices, helping organizations build more resilient systems through systematic learning from failures.
This term is sponsored by: your name/company?
- Previous term: POSSE
- Next term: Post mortem analysis
- Random term: SUT (webglossary.info/random 🎲)