Nvidia
Engineering Manager, Deep Learning Inference
Found: November 1, 2025
This role is based in multiple locations including Santa Clara, CA and offers remote options.
Compensation:
$224,000 - $425,500/year
Responsibilities:
- Lead, mentor, and scale a high-performing engineering team focused on deep learning inference and GPU-accelerated software.
- Drive the strategy, roadmap, and execution of NVIDIA’s inference frameworks engineering.
- Partner with internal teams to deliver optimized inference pipelines across NVIDIA accelerators.
- Oversee performance tuning and optimization of large-scale models for AI applications.
- Guide engineers in adopting best practices for CUDA, Triton, and multi-GPU communications.
Requirements:
- MS, PhD, or equivalent experience in Computer Science or related field.
- 6+ years of software development experience, including 3+ years in technical leadership.
- Strong background in C/C++ software design and development; proficiency in Python is a plus.
- Hands-on experience with GPU programming and performance optimization.
- Proven record of deploying or optimizing deep learning models in production environments.