Nvidia

Senior Deep Learning Software Engineer, Inference and Model Optimization

2 Locations Remote

Found: October 17, 2025

⚠️ This job posting is no longer active and may not be accepting applications. Browse similar live jobs below, or see all current Nvidia jobs.

This role is based in Santa Clara, CA or remote.

Compensation:

$148,000 - $287,500/year

Responsibilities:

Train, develop, and deploy state-of-the-art generative AI models.
Leverage the torch 2.0 ecosystem for automated deployment solutions.
Develop high-performance optimization techniques for inference.
Collaborate with teams across NVIDIA for kernel implementations.
Analyze GPU kernel-level performance for optimization opportunities.
Innovate on inference performance to maintain market leadership.
Architect and design a modular software platform for user experience.

Requirements:

Masters, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field.
3+ years of relevant work or research experience in Deep Learning.
Excellent software design skills and proficiency in Python, PyTorch, and related ML tools.
Strong algorithms and programming fundamentals.
Good communication skills and ability to work in a fast-paced environment.

Tech stack:

Python, PyTorch, CUDA, TRT, Triton, HuggingFace.

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.

Apple

Google

Amazon

Meta

OpenAI

Microsoft

Nvidia

Stripe

TikTok

Netflix

Uber

Airbnb

Booking

Spotify

Canva

or use email

💰 What does this role pay?

Real advertised salary data — median, 25th & 75th percentile — for this role at big tech.

AI/ML Engineer salary (US big-tech median)AI/ML Engineer salary at Pinterest AI/ML Engineer salary at Reddit AI/ML Engineer salary at Airbnb

Same role, other locations

🎶 Spotify

Staff Machine Learning Engineer, Personalization

New York

View Job Apply

Stanislav Prigodich

Hey, I'm Stan

Software Developer & Creator of Top Jobs Today

I'm a software developer, and over time I realized I cared mostly about roles at big tech companies - not just whatever happened to show up on LinkedIn or generic job boards. But those sources weren't enough - some roles were delayed, or never posted at all.

So I built this website to solve that. It scrapes fresh job postings directly from official company sites, figures out what kind of roles they really are, and sends them as email alerts - simple, fast, and focused.

Hope it makes your search easier too. Wishing you the best of luck - and I'm really glad you're here!