II.
LibraryProcess overview
Reference · livelib-process:devops-sre-platform--error-budget-management
error-budget-management overview
Error Budget Management Process - Comprehensive SLO-based reliability engineering framework covering error budget calculation, burn rate monitoring, policy enforcement, stakeholder communication, incident correlation, and continuous reliability improvement through data-driven decision making.
Attributes
displayName
error-budget-management
description
Error Budget Management Process - Comprehensive SLO-based reliability engineering framework covering
error budget calculation, burn rate monitoring, policy enforcement, stakeholder communication, incident correlation,
and continuous reliability improvement through data-driven decision making.
libraryPath
library/specializations/devops-sre-platform/error-budget-management.js
specialization
devops-sre-platform
references
- - Google SRE Book - Embracing Risk: https://sre.google/sre-book/embracing-risk/ - Google SRE Workbook - Implementing SLOs: https://sre.google/workbook/implementing-slos/ - The Site Reliability Workbook - Error Budgets: https://sre.google/workbook/error-budget-policy/ - Alex Hidalgo - Implementing Service Level Objectives: https://www.alex-hidalgo.com/ - Sloth - SLO Toolkit: https://github.com/slok/sloth
example
const result = await orchestrate('specializations/devops-sre-platform/error-budget-management', {
services: ['payment-api', 'checkout-service', 'user-service'],
slos: {
availability: { target: 99.9, window: '30d' },
latency: { target: 200, percentile: 95, window: '30d' },
errorRate: { target: 0.1, window: '30d' }
},
reportingPeriod: '30d',
alertThresholds: {
burnRateFast: 14.4, // 2% budget in 1 hour
burnRateSlow: 6.0 // 5% budget in 6 hours
},
stakeholders: ['engineering', 'product', 'leadership'],
environment: 'production'
});
usesAgents
- general-purpose
Outgoing edges
lib_applies_to_domain1
- domain:devops·DomainDevOps
lib_belongs_to_specialization1
- specialization:devops-sre-platform·Specialization
lib_covers_topic1
- topic:observability-driven-development·TopicObservability-Driven Development
lib_implements_workflow1
- workflow:incident-response·Workflow
lib_involves_role1
- role:sre·Role
lib_requires_skill_area2
- skill-area:sli-slo-management·SkillAreaSLI / SLO Management
- skill-area:alerting-oncall·SkillAreaAlerting & On-Call Management
Incoming edges
None.