Slide 32
Slide 32 text
32
IRF 2.0: Role, Playbook
• On-Call Engineer
• The engineer on call. Triages alerts and escalates to the IC if necessary, initiating the
IRF (declaring the incident).
• Incident Commander(IC)
• Leads the incident response. Brings in necessary people and organizes information.
May also act as the CL (Communication Lead).
• Usually a Tech Lead or Engineering Manager.
• Their responsibility is not to directly fix the issue, but to organize and make decisions.
• Responder
• Handles the actual work—such as rollbacks, config changes, etc.
• Communication Lead(CL)
• Handles communication with external stakeholders (i.e., non-engineers).
Key point: Separate responsibilities between IC and Responder
2-3. P2: Root Fixes/ Rolling Out a Unified Incident Response Process