Nvidia
Principal Product Manager
Found: Today
This role is based in multiple locations: Santa Clara, CA; New York, NY; and Seattle, WA.
Compensation:
$240,000 - $379,500/year
Responsibilities:
- Manage the strategic direction and roadmap of the break-fix automation system.
- Define automation confidence thresholds and human-in-the-loop intervention points.
- Build operator UX for repair queues and workflow transparency.
- Drive integration between failure attribution and automated repair actions.
- Define repair SLOs and own metrics for fleet availability.
- Collaborate with NCP operators and hardware vendor partners.
Requirements:
- 15+ years of product management experience in infrastructure, platform, or MLOps.
- BS or MS in Computer Science, Engineering, or related area.
- Expertise with distributed systems and workflow orchestration.
- Strong operator UX instincts and ability to build alignment across teams.
Ways to Stand Out:
- Hands-on experience with GPU infrastructure or AI factory environments.
- Experience with RMA logistics and hardware repair processes.
- Background in reliability engineering or chaos testing.