Nvidia
Engineering Manager, Deep Learning Inference
Found: November 1, 2025
This role is based in multiple locations including Santa Clara, CA, and offers remote options.
Compensation:
$224,000 - $425,500/year
Responsibilities:
- Lead and mentor an engineering team in deep learning inference.
- Drive strategy and execution of NVIDIA’s inference frameworks.
- Collaborate with internal teams for optimized inference pipelines.
- Oversee performance tuning and optimization of AI models.
- Guide engineers in best practices for CUDA and multi-GPU communications.
Requirements:
- MS, PhD, or equivalent experience in Computer Science or related field.
- 6+ years of software development experience, including 3+ years in leadership roles.
- Strong background in C/C++ and GPU programming.
- Experience with deploying deep learning models in production.
Tech stack:
C/C++, Python, CUDA, Triton, CUTLASS.