II.
Role overview
Reference · liverole:incident-commander
Incident Commander overview
Takes command during active high-severity incidents, coordinating all responders, driving toward resolution, and serving as the single decision-making authority to avoid chaos during outages. Manages stakeholder communications, escalations, and external notifications throughout the incident lifecycle. Leads post-incident reviews and ensures action items are owned and tracked to completion.
Attributes
displayName
Incident Commander
isAgentic
false
automatability
0.1
description
Takes command during active high-severity incidents, coordinating all responders,
driving toward resolution, and serving as the single decision-making authority
to avoid chaos during outages. Manages stakeholder communications, escalations,
and external notifications throughout the incident lifecycle. Leads post-incident
reviews and ensures action items are owned and tracked to completion.
seniority
senior
Outgoing edges
applies_to1
- domain:software-engineering·DomainSoftware Engineering
holds_responsibility5
- responsibility:incident-response·Responsibility
- responsibility:stakeholder-communication·ResponsibilityStakeholder Communication
- responsibility:on-call·ResponsibilityOn-Call
- responsibility:sla-management·ResponsibilitySLA Management
- responsibility:disaster-recovery·Responsibility
requires_expertise4
- skill-area:incident-response·SkillAreaIncident Response
- skill-area:runbook-writing·SkillAreaRunbook Writing
- skill-area:sli-slo-management·SkillAreaSLI / SLO Management
- skill-area:observability-pipeline·SkillAreaObservability Pipeline
requires_skill3
- specialization:sre·Specialization
- domain:observability·DomainObservability
- domain:cloud-infra·DomainCloud Infrastructure
Incoming edges
escalates_to1
- human-checkpoint:dangerous-tool-approval·HumanCheckpointDangerous-tool approval
has_member1
- org-unit:incident-response-team·OrgUnitIncident Response Team
held_by4
- responsibility:respond-incidents·ResponsibilityRespond to production incidents
- responsibility:incident-response-coordination·ResponsibilityIncident response coordination
- responsibility:postmortem-writeup·ResponsibilityPostmortem writeup
- responsibility:incident-command·ResponsibilityIncident command
involves_role32
- workflow:incident-response·Workflow
- workflow:post-mortem-review·WorkflowPost-Mortem Review
- workflow:hotfix-deployment·Workflow
- workflow:rollback-procedure·WorkflowRollback Procedure
- workflow:flight-operations-review·WorkflowFlight Operations Review
- workflow:business-continuity-drill·WorkflowBusiness Continuity Drill
- workflow:full-stack-system-reliability-review·WorkflowFull-Stack System Reliability Review
- workflow:post-incident-review·WorkflowPost-Incident Review
- workflow:customer-escalation-management·WorkflowCustomer Escalation Management
- workflow:backup-recovery-drill·WorkflowBackup Recovery Drill
- workflow:runbook-review-cycle·WorkflowRunbook Review Cycle
- workflow:disaster-recovery-failover-drill·WorkflowDisaster Recovery Failover Drill
- workflow:grid-stability-monitoring·WorkflowGrid Stability Monitoring
- workflow:change-advisory-board-review·WorkflowChange Advisory Board Review
- workflow:security-incident-response·WorkflowSecurity Incident Response
- workflow:rollback-execution·WorkflowRollback Execution
- workflow:post-mortem·WorkflowPost-Mortem Review
- workflow:patient-safety-event-review·WorkflowPatient Safety Event Review
- workflow:chaos-game-day·WorkflowChaos Game Day
- workflow:mine-safety-audit·WorkflowMine Safety Audit
- workflow:error-budget-exhaustion-review·WorkflowError Budget Exhaustion Review
- workflow:incident-response·Workflow
- workflow:pharmacovigilance-signal-detection·WorkflowPharmacovigilance Signal Detection
- workflow:crisis-communication-drill·WorkflowCrisis Communication Drill
- workflow:cve-response-coordination·WorkflowCVE Response Coordination
- workflow:red-team-exercise·WorkflowRed Team Exercise
- workflow:event-operations-planning·WorkflowEvent Operations Planning
- workflow:sla-breach-response·WorkflowSLA Breach Response
- workflow:incident-customer-communication·WorkflowIncident Customer Communication
- workflow:mobile-workforce-safety-check·WorkflowMobile Workforce Safety Check
- workflow:backup-recovery-drill·WorkflowBackup Recovery Drill
- workflow:incident-response·Workflow
lib_involves_role7
- lib-agent:cryptography-blockchain--incident-response·LibraryAgentincident-response
- lib-agent:devops-sre-platform--incident-commander·LibraryAgentincident-commander
- lib-agent:customer-experience--escalation-coordinator·LibraryAgentescalation-coordinator
- lib-agent:security-compliance--forensic-analysis-agent·LibraryAgentforensic-analysis-agent
- lib-agent:security-compliance--incident-triage-agent·LibraryAgentincident-triage-agent
- lib-process:devops-sre-platform--incident-response·LibraryProcessincident-response
- lib-skill:customer-experience--escalation-workflow·LibrarySkillescalation-workflow
supports_work3
- tool-server:mcp-sentry·ToolServerMCP Sentry
- tool-server:mcp-datadog·ToolServerDatadog MCP Server
- tool-server:mcp-grafana·ToolServerGrafana MCP Server
used_by_role1
- stack-profile:incident-management-platform·StackProfileIncident Management (Go, PostgreSQL, Redis, PagerDuty, Slack, Prometheus)