Nvidia

Senior Production Engineer - DGX Cloud

6 Locations

Found: Today

This role is remote with multiple locations available including CA, NC, TX, CO, and WA.

Compensation:

$168,000 - $333,500/year based on experience and level.

Responsibilities:

  • Work on production systems for scalable GPU clusters for AI workloads.
  • Implement monitoring and health management for GPU assets.
  • Collaborate with teams to ensure reliable AI cluster performance.

Requirements:

  • 8+ years in Production Engineering/DevOps/SRE roles.
  • Experience with large-scale production systems.
  • BS in Computer Science, Engineering, or related field.
  • Proficient in systems programming languages (Go, Python).

Tech stack:

GPU, Kubernetes, Slurm, Bright Cluster Manager.

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.

Apple Google Amazon Meta OpenAI Microsoft Nvidia Stripe TikTok Netflix Uber Airbnb Booking Spotify Canva Pinterest
or use email
Stanislav Prigodich

Hey, I'm Stan

Software Developer & Creator of Top Jobs Today

I'm a software developer, and over time I realized I cared mostly about roles at big tech companies - not just whatever happened to show up on LinkedIn or generic job boards. But those sources weren't enough - some roles were delayed, or never posted at all.

So I built this website to solve that. It scrapes fresh job postings directly from official company sites, figures out what kind of roles they really are, and sends them as email alerts - simple, fast, and focused.

Hope it makes your search easier too. Wishing you the best of luck - and I'm really glad you're here!