From the course: Site Reliability Engineering Essential Training

Unlock this course with a free trial

Join today to access over 24,900 courses taught by industry experts.

Implementing SLOs: Real-world examples

Implementing SLOs: Real-world examples

All right. Let's look at implementing SLOs, real-world examples. I'll walk you through a series of SLO definitions and hopefully, things will become very clear on how to set SLOs for yourselves. First, let's take a web application. The class. So I'm adding an additional field called class here. This is to categorize SLOs to help you understand SLOs a little bit better. So this SLO is about availability. The service must maintain 99.99% availability over a rolling four-week period. This is the most basic, straightforward availability SLO. 99.99% availability, and you want it over every four rolling weeks. SLI, what is the SLI that powers this SLO? It's a percentage of requests that do not result in 5XX which is basically any 500 series HTTP status code. So this is important. I am defining what the SLI is. It's basically the percentage of successful requests. And what are successful requests? That do not result in 500 HTTP status code errors. And what is the measurement? Count of…

Contents