Nvidia

Senior DL Algorithms Engineer - Inference Performance

Canada, Toronto

Found: October 16, 2025

⚠️ This job posting is no longer active and may not be accepting applications. Browse similar live jobs below, or see all current Nvidia jobs.

This role is based in Toronto, Canada.

Compensation:

116,250 CAD - 247,000 CAD depending on level.

Responsibilities:

Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
Contribute new features, fix bugs, and deliver production code to TRT-LLM, NVIDIA’s open-source inference serving library.
Profile and analyze bottlenecks across the full inference stack to enhance performance.
Benchmark state-of-the-art offerings in various DL models inference and perform competitive analysis.
Collaborate with SW/HW co-design teams for next-gen AI-powered services.

Requirements:

PhD in CS, EE, or equivalent experience.
3+ years of experience in deep learning and neural networks.
Experience with performance profiling and optimization for GPU-based applications.
Proficient in C++, PyTorch, or equivalent frameworks.
Deep understanding of computer architecture and GPU fundamentals.

Preferred Qualifications:

Experience with processor and system-level performance optimization.
Understanding of modern LLM architectures.
Strong fundamentals in algorithms.
GPU programming experience (CUDA or OpenCL) is a plus.

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.

Apple

Google

Amazon

Meta

OpenAI

Microsoft

Nvidia

Stripe

TikTok

Netflix

Uber

Airbnb

Booking

Spotify

Canva

or use email

💰 What does this role pay?

Real advertised salary data — median, 25th & 75th percentile — for this role at big tech.

FullStack Developer salary (US big-tech median)FullStack Developer salary at Anthropic FullStack Developer salary at Openai FullStack Developer salary at Airbnb

Same role, other locations

🍎 Apple

SoC Full Chip DV Engineer

Cupertino

View Job Apply

🍎 Apple

SoC Full Chip DV Engineer

Cupertino

View Job Apply

🎬 Netflix

Full Stack Engineer 4 - User & Access Management Engineering

Los Gatos, California, United States of America and 3 more

View Job Apply

Stanislav Prigodich

Hey, I'm Stan

Software Developer & Creator of Top Jobs Today

I'm a software developer, and over time I realized I cared mostly about roles at big tech companies - not just whatever happened to show up on LinkedIn or generic job boards. But those sources weren't enough - some roles were delayed, or never posted at all.

So I built this website to solve that. It scrapes fresh job postings directly from official company sites, figures out what kind of roles they really are, and sends them as email alerts - simple, fast, and focused.

Hope it makes your search easier too. Wishing you the best of luck - and I'm really glad you're here!