OpenAI
Data Center Incident Program Manager
Found: Today
This role is remote-based in the US.
Compensation:
$125.6K – $228K
Responsibilities:
- Define and maintain incident severity levels and escalation thresholds.
- Establish incident response standards and protocols.
- Implement incident management tooling and ensure integrations with monitoring systems.
- Lead post-incident reviews and track corrective actions.
Requirements:
- 7+ years in mission-critical infrastructure or data center operations.
- Experience leading major incidents and conducting root cause analysis.
- Ability to remain calm and decisive under pressure.
Preferred Skills:
- Experience in hyperscale AI compute environments.
- Familiarity with incident management tools like PagerDuty or ServiceNow.