II.
Workflow overview
Reference · liveworkflow:disaster-recovery-failover-drill
Disaster Recovery Failover Drill overview
Executes end-to-end disaster recovery failover tests -- triggering regional failover for critical services, validating data replication lag and consistency, measuring RTO and RPO against defined targets, testing DNS failover propagation, verifying backup restoration procedures, evaluating communication plan execution, documenting deviations from expected behavior, and tracking remediation items for gap closure. Excludes DR architecture design and backup strategy definition.
Attributes
displayName
Disaster Recovery Failover Drill
workflowKind
operational
triggerType
scheduled
typicalCadence
semi-annual
complexity
cross-team
description
Executes end-to-end disaster recovery failover tests -- triggering regional
failover for critical services, validating data replication lag and
consistency, measuring RTO and RPO against defined targets, testing DNS
failover propagation, verifying backup restoration procedures, evaluating
communication plan execution, documenting deviations from expected behavior,
and tracking remediation items for gap closure. Excludes DR architecture
design and backup strategy definition.
Outgoing edges
applies_to_domain2
- domain:infrastructure·DomainInfrastructure
- domain:cloud-infra·DomainCloud Infrastructure
involves_role4
- role:platform-engineer·Role
- role:cloud-architect·Role
- role:incident-commander·RoleIncident Commander
- role:database-administrator·RoleDatabase Administrator
performed_by_org_unit3
- org-unit:platform-team·OrgUnitPlatform Team
- org-unit:infra-engineering·OrgUnitInfrastructure Engineering
- org-unit:incident-response-team·OrgUnitIncident Response Team
requires_skill_area2
- skill-area:chaos-engineering·SkillAreaChaos Engineering
- skill-area:incident-response·SkillAreaIncident Response
triggers_responsibility3
- responsibility:respond-incidents·ResponsibilityRespond to production incidents
- responsibility:runbook-authoring·ResponsibilityRunbook authoring
- responsibility:capacity-planning·ResponsibilityCapacity Planning
Incoming edges
follows_workflow1
- stack-profile:disaster-recovery·StackProfileDisaster Recovery (Terraform, Kubernetes, Prometheus, PostgreSQL, S3)