Building a Successful Site Reliability Engineering Organization: Like Google, Amazon, and Netflix
Site reliability engineering (SRE) is a new discipline that focuses high availability and reliability of production systems that are mission and revenue critical for an organization. This article is meant for developers, DevOps engineers and engineering leaders on how to build highly available and reliable system for their customers. In...