SRE Team Management: Operational Overload

placeholder

Site reliability engineers (SREs) are responsible for many administrative tasks often splitting their time between reactive ops work and special projects. To ensure teams do not become overloaded SREs may be transferred to a team in order to prevent or help mitigate overload. In this course you will learn how to deal with operational overload. You’ll start by examining ops mode which is an approach used to ensure services are properly maintained and optimized. You’ll discover factors that contribute to team morale and stress. In addition you will outline emergency planning strategies and best practices as well as learn how to categorize emergencies and prepare detailed emergency plans. Next you’ll explore how knowledge sharing relates to emergency preparedness the key to writing successful postmortems the importance of service level objectives and how an appropriate level of detail is required to properly explain your findings. Lastly you’ll discover the key factors and attributes of successful teams. Youll examine a team-first approach and differentiate between questioning techniques such as open/closed funnel probing and leading.