Category: Operational Resilience Author: Cody Swidler Tags: incident command, ICS, crisis management, operational resilience, incident response, command structure
Resilience programs spend enormous effort on the things you can prepare in advance — business impact analyses, recovery objectives, runbooks, dependency maps. All of it matters. But none of it runs itself. When an incident hits, every one of those artifacts has to be picked up and used by people, under pressure, with incomplete information, often at three in the morning. The thing that determines whether that happens well is your incident command structure. And it's the piece most programs underinvest in.
You can have flawless recovery plans and still fail in the first thirty minutes because no one knew who was in charge, three people were making conflicting decisions, and the bridge call had forty people on mute and no one driving. Incident command is the backbone of operational resilience because it's the system that converts all your preparation into a coordinated response.
What Incident Command Actually Is
Incident command — borrowed and adapted from the emergency-services Incident Command System — is a defined structure for who leads, who does what, and how decisions and information flow during a disruption. Its core insight is simple and powerful: in a crisis, you don't want to invent the org chart while the building is on fire. You establish the roles in advance and activate them on demand.
The central role is the Incident Commander — one person, clearly designated, who owns the response. Not the most senior person in the room by default, and not a committee. The IC's job isn't to fix the technical problem; it's to run the response: set priorities, make the calls that need making, manage the flow of information, and decide when to escalate. Around the IC sit defined functions — technical lead, communications, scribe, liaison to leadership — each with a clear remit so the work doesn't collide.
Why This Is a Resilience Function, Not Just an IT One
Most organizations have some version of incident response inside engineering — an on-call rotation, a sev1 process, a PagerDuty escalation. That's necessary, but it's scoped to technical incidents. Operational resilience is broader: a disruption might be a vendor outage, a physical-site loss, a fraud event, a regulatory action, or a cyber incident with legal and customer-notification dimensions. Those need command, coordination, and communication that reach well past the engineering team.
This is the same theme I keep coming back to in operational resilience is an ownership problem: the frameworks tell you to have a response capability, but capability lives in clear ownership and practiced roles, not in a document. Incident command is where that ownership becomes concrete. It names who runs the response before you need to know.
It's also increasingly a regulatory expectation. Regimes like DORA care not just about whether you can recover, but whether you can manage and communicate through an incident — to leadership, to customers, to regulators — on defined timelines. That's a command-and-communication capability, and it cuts across the whole organization, not just ICT.
The Tiers: Tactical, Operational, Strategic
Mature incident command separates three altitudes of response, and conflating them is a common failure. The tactical tier is the hands on keyboards fixing the problem. The operational tier — the incident commander and core functions — coordinates the response, removes blockers, and manages information. The strategic tier — executives and crisis management — handles enterprise-level decisions: major spend, regulatory notification, public statements, decisions with legal or reputational weight.
When these tiers blur, you get the classic failure modes: executives diving into technical troubleshooting they're not equipped for, or engineers being pulled into customer-communication decisions instead of fixing the issue. A clear command structure keeps each tier doing its own job and defines the escalation triggers between them — so the strategic tier gets pulled in when it's actually needed, not too early and not too late.
You Don't Have a Capability Until You've Tested It
Here's the uncomfortable truth: an incident command structure that exists only on paper does not exist. The first time people run the roles cannot be during a real Sev1. Command is a skill, and skills decay. The IC has to have practiced making decisions with incomplete information. The comms lead has to have drafted a holding statement under time pressure. The handoffs between tiers have to have been rehearsed.
That's what tabletop exercises are for — not to check a compliance box, but to build the muscle memory and surface the gaps while the stakes are simulated. A good tabletop exercise playbook puts the command structure under realistic pressure: ambiguous information, competing priorities, a clock running, and decisions that have to be made by the people who'll actually make them in a real event. The gaps you find in a two-hour exercise are the ones you won't discover at three in the morning.
Where to Start
If your incident response today is "the senior engineer who happens to be online runs it," you have a tactical capability and no command structure. Start by designating the Incident Commander role and writing down what it owns. Define the core functions around it. Establish the three tiers and the escalation triggers between them. Then run a tabletop and watch where it breaks — because it will, and that's the point.
Recovery plans tell you what to restore. Incident command determines whether anyone actually does it in a coordinated way when it counts. It's the backbone — invest in it like one.
Cody Swidler is the founder of PivotRisk and a Principal Program Manager, Enterprise Resiliency at Apex Clearing. He has built and scaled GRC, resilience, and risk programs across Microsoft, Twilio, Box, Zayo, and Miro.
Test your command structure before you need it
The Tabletop Exercise Playbook gives you ready-to-run scenarios that put your incident command roles, escalation triggers, and decision-making under realistic pressure.
Get the Tabletop Playbook