From the course: DevOps Foundations: Microservices

Unlock the full course today

Join today to access over 24,900 courses taught by industry experts.

Site reliability engineering

Site reliability engineering

Site reliability engineering is a relatively new practice in IT. It's closely related to DevOps, but with a different emphasis and more specific practices. Let's dig into the history of SRE and how DevOps and SRE work together to enable microservices. SRE was originally developed at Google to address their rapid expansion of infrastructure. It's no secret that Google has a massive and ever growing need for compute and networking infrastructure. This led to a scaling problem. If the number of engineers needed to maintain all that infrastructure were to increase linearly with that infrastructure, it would quickly become unsustainable. Google and any other company of similar scale would sooner rather than later, need thousands of sysadmins. Thus, SRE was developed as a way to scale human resources logarithmically with infrastructure by measuring the right metrics, hiring engineers with the right combination of skills, and emphasizing automation, hyperscale enterprises can run at a…

Contents