Nvidia

Engineering Manager, Deep Learning Inference

4 Locations Remote

Found: November 1, 2025

View Details and Apply

This role is based in multiple locations including Santa Clara, CA, and offers remote options.

Compensation:

$224,000 - $425,500/year

Responsibilities:

Lead and mentor an engineering team in deep learning inference.
Drive strategy and execution of NVIDIA’s inference frameworks.
Collaborate with internal teams for optimized inference pipelines.
Oversee performance tuning and optimization of AI models.
Guide engineers in best practices for CUDA and multi-GPU communications.

Requirements:

MS, PhD, or equivalent experience in Computer Science or related field.
6+ years of software development experience, including 3+ years in leadership roles.
Strong background in C/C++ and GPU programming.
Experience with deploying deep learning models in production.

Tech stack:

C/C++, Python, CUDA, Triton, CUTLASS.

View Details and Apply

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.

or use email

Hey, I'm Stan

Software Developer & Creator of Top Jobs Today

I'm a software developer, and over time I realized I cared mostly about roles at big tech companies - not just whatever happened to show up on LinkedIn or generic job boards. But those sources weren't enough - some roles were delayed, or never posted at all.

So I built this website to solve that. It scrapes fresh job postings directly from official company sites, figures out what kind of roles they really are, and sends them as email alerts - simple, fast, and focused.

Hope it makes your search easier too. Wishing you the best of luck - and I'm really glad you're here!