Nvidia
Senior Deep Learning Software Engineer, Inference and Model Optimization
Found: October 17, 2025
This role is based in Santa Clara, CA or remote.
Compensation:
$148,000 - $287,500/year
Responsibilities:
- Train, develop, and deploy state-of-the-art generative AI models.
- Leverage the torch 2.0 ecosystem for automated deployment solutions.
- Develop high-performance optimization techniques for inference.
- Collaborate with teams across NVIDIA for kernel implementations.
- Analyze GPU kernel-level performance for optimization opportunities.
- Innovate on inference performance to maintain market leadership.
- Architect and design a modular software platform for user experience.
Requirements:
- Masters, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field.
- 3+ years of relevant work or research experience in Deep Learning.
- Excellent software design skills and proficiency in Python, PyTorch, and related ML tools.
- Strong algorithms and programming fundamentals.
- Good communication skills and ability to work in a fast-paced environment.
Tech stack:
Python, PyTorch, CUDA, TRT, Triton, HuggingFace.