Nvidia

Engineering Manager, Deep Learning Inference

4 Locations

Found: November 1, 2025

⚠️ This job posting is no longer active and may not be accepting applications. Browse similar live jobs below, or see all current Nvidia jobs.

This role is based in multiple locations including Santa Clara, CA and offers remote options.

Compensation:

$224,000 - $425,500/year

Responsibilities:

Lead, mentor, and scale a high-performing engineering team focused on deep learning inference and GPU-accelerated software.
Drive the strategy, roadmap, and execution of NVIDIA’s inference frameworks engineering.
Partner with internal teams to deliver optimized inference pipelines across NVIDIA accelerators.
Oversee performance tuning and optimization of large-scale models for AI applications.
Guide engineers in adopting best practices for CUDA, Triton, and multi-GPU communications.

Requirements:

MS, PhD, or equivalent experience in Computer Science or related field.
6+ years of software development experience, including 3+ years in technical leadership.
Strong background in C/C++ software design and development; proficiency in Python is a plus.
Hands-on experience with GPU programming and performance optimization.
Proven record of deploying or optimizing deep learning models in production environments.

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.