II.
Methodology overview
Reference · livemethodology:Google-SRE-methodology
Google SRE Methodology overview
Site Reliability Engineering as practiced at Google — treating operations as a software engineering problem. Defines service reliability through SLIs, SLOs, and error budgets. When error budget is exhausted, feature velocity slows to focus on reliability. Includes practices like toil budgeting (< 50% ops work), blameless postmortems, progressive rollouts, and on-call engineering rotations.
Attributes
displayName
Google SRE Methodology
description
Site Reliability Engineering as practiced at Google — treating operations as
a software engineering problem. Defines service reliability through SLIs, SLOs,
and error budgets. When error budget is exhausted, feature velocity slows to
focus on reliability. Includes practices like toil budgeting (< 50% ops work),
blameless postmortems, progressive rollouts, and on-call engineering rotations.
methodologyKind
operations
origin
Google (Ben Treynor Sloss)
yearIntroduced
2003
Outgoing edges
applies_to3
- domain:site-reliability·DomainSite Reliability Engineering
- domain:devops·DomainDevOps
- platform:krate·PlatformKrate
Incoming edges
None.