II.
SkillArea overview
Reference · liveskill-area:incident-management
Incident Management overview
Coordinating the detection, response, and resolution of production incidents - severity classification, incident commander roles, escalation paths, real-time communication, and blameless postmortems. Reduces mean time to recovery and builds organizational memory that prevents repeated failure patterns in SRE and operations contexts.
Attributes
displayName
Incident Management
description
Coordinating the detection, response, and resolution of production incidents -
severity classification, incident commander roles, escalation paths, real-time
communication, and blameless postmortems. Reduces mean time to recovery and
builds organizational memory that prevents repeated failure patterns in SRE
and operations contexts.
domains
expertiseLevels
- intermediate
- expert
Outgoing edges
applies_to2
- domain:devops·DomainDevOps
- specialization:sre·Specialization
prerequisite_for_learning3
- skill-area:runbook-authoring·SkillAreaRunbook Authoring
- skill-area:incident-response·SkillAreaIncident Response
- skill-area:runbook-writing·SkillAreaRunbook Writing
Incoming edges
lib_requires_skill_area22
- lib-agent:data-science-ml--incident-responder·LibraryAgentincident-responder
- lib-agent:devops-sre-platform--incident-commander·LibraryAgentincident-commander
- lib-agent:devops-sre-platform--sre-expert·LibraryAgentsre-expert
- lib-agent:customer-experience--escalation-coordinator·LibraryAgentescalation-coordinator
- lib-agent:customer-experience--itil-service-manager·LibraryAgentitil-service-manager
- lib-agent:public-relations--crisis-communications-expert·LibraryAgentcrisis-communications-expert
- lib-agent:supply-chain--disruption-response-manager·LibraryAgentdisruption-response-manager
- lib-agent:healthcare--patient-safety-officer·LibraryAgentpatient-safety-officer
- lib-process:collaboration--pr-lifecycle-hotfix·LibraryProcessspecializations/collaboration/github/pr-lifecycle-hotfix
- lib-process:devops-sre-platform--incident-response·LibraryProcessincident-response
- lib-process:observability--sre-aws·LibraryProcessspecializations/observability/sre/sre-aws
- lib-process:observability--sre-azure·LibraryProcessspecializations/observability/sre/sre-azure
- lib-process:observability--sre-base·LibraryProcessspecializations/observability/sre/sre-base
- lib-process:observability--sre-gcp·LibraryProcessspecializations/observability/sre/sre-gcp
- lib-process:security-compliance--incident-response·LibraryProcessincident-response
- lib-skill:devops-sre-platform--incident-platforms·LibrarySkillincident-platforms
- lib-skill:customer-experience--escalation-workflow·LibrarySkillescalation-workflow
- lib-skill:project-management--issue-tracker·LibrarySkillissue-tracker
- lib-skill:public-relations--crisis-management-platform·LibrarySkillcrisis-management-platform
- lib-skill:supply-chain--disruption-response-coordinator·LibrarySkilldisruption-response-coordinator
- lib-skill:arts-culture--risk-mitigation-planning·LibrarySkillrisk-mitigation-planning
- lib-skill:healthcare--patient-safety-event-analysis·LibrarySkillpatient-safety-event-analysis
prerequisite_for_learning5
- skill-area:runbook-automation·SkillAreaRunbook Automation
- skill-area:on-call-optimization·SkillAreaOn-Call Optimization
- skill-area:post-incident-review·SkillAreaPost-Incident Review
- skill-area:change-management-ops·SkillAreaChange Management (Operations)
- skill-area:alerting-oncall·SkillAreaAlerting & On-Call Management
requires_expertise2
- responsibility:on-call-rotation-fairness·ResponsibilityOn-call rotation fairness
- responsibility:incident-response-coordination·ResponsibilityIncident response coordination
requires_skill_area2
- stack-profile:incident-management-platform·StackProfileIncident Management (Go, PostgreSQL, Redis, PagerDuty, Slack, Prometheus)
- workflow:on-call-handoff·WorkflowOn-Call Handoff
tool_used_by1
- tool:pagerduty·ToolPagerDuty
used_for1
- tool:pagerduty·ToolPagerDuty