A newly-formed site reliability engineering (SRE) team is focused on increasing the resilience of IT systems. The team is investigating tooling options that can be used to diagnose issues and automate operational responses.
Which practice would BEST help this team?
Submit