OpenAI

Inference Runtime, Engineering Manager

San Francisco

Found: March 3, 2026

Location:

San Francisco

Compensation:

$455K – $555K/year

Responsibilities:

  • Lead a team of engineers specializing in distributed systems and model architecture.
  • Collaborate with machine learning researchers and product managers to deploy technologies.
  • Improve performance, latency, and efficiency of the model inference stack.
  • Optimize code and GPU fleet for maximum resource utilization.

Requirements:

  • 15+ years of professional software engineering experience.
  • Familiarity with PyTorch, NVidia GPUs, and HPC technologies.
  • Experience in architecting and debugging production distributed systems.

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.

Apple Google Amazon Meta OpenAI Microsoft Nvidia Stripe TikTok Netflix Uber Airbnb Booking Spotify Canva Pinterest
or use email

Similar Big Tech Jobs - Posted in the Past 24h

📌 Pinterest

Engineering Manager, Big Data Storage

Palo Alto + 1 other locations
🔍 Google

Software Engineering Manager II, Storage, Google Distributed Cloud

place Raleigh, NC, USA ; Durham, NC, USA
🔍 Google

Engineering Manager, Marketplace Serving

place Mountain View, CA, USA
Stanislav Prigodich

Hey, I'm Stan

Software Developer & Creator of Top Jobs Today

I'm a software developer, and over time I realized I cared mostly about roles at big tech companies - not just whatever happened to show up on LinkedIn or generic job boards. But those sources weren't enough - some roles were delayed, or never posted at all.

So I built this website to solve that. It scrapes fresh job postings directly from official company sites, figures out what kind of roles they really are, and sends them as email alerts - simple, fast, and focused.

Hope it makes your search easier too. Wishing you the best of luck - and I'm really glad you're here!