When your application goes haywire, the most valuable engineering skill is not the the ability to bring up a copy of your system or even knowledge of a your technology stack (although it doesn’t hurt). It is the skill of understanding and solving problems.
Finding the root cause of the issue and mitigating it with minimal disruption is a must-have skill for engineers responsible for developing and managing production systems. In this talk I will discuss the skills required to troubleshoot complex systems and traits that prevent engineers from being successful at troubleshooting. We will discuss some tips and tricks for troubleshooting complex systems in production.