SRE Emergency & Incident Response: Responding to Emergencies
“Site Reliability Engineers (SREs) are responsible for assigning the appropriate resources and responsibilities to effectively deal with unexpected emergencies. To do this SREs should ensure the proper processes and teams are in place before an emergency occurs.
In this course you ll explore the different emergency types and outline how to plan for them. You ll examine the causes of and how to respond to test-induced change-induced and process-induced emergencies and what s involved in proactive approaches to emergency testing and planning.
You ll then outline the critical steps to correctly documenting emergencies including the history of outages and mistakes. You ll then differentiate between business continuity and disaster recovery planning and outline how to create both types of plans and conduct a business impact analysis. Lastly you ll explore some IT recovery strategies.”