Nvidia

Senior DL Algorithms Engineer - Inference Performance

Canada, Toronto

Found: October 16, 2025

This role is based in Toronto, Canada.

Compensation:

116,250 CAD - 247,000 CAD depending on level.

Responsibilities:

  • Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
  • Contribute new features, fix bugs, and deliver production code to TRT-LLM, NVIDIA’s open-source inference serving library.
  • Profile and analyze bottlenecks across the full inference stack to enhance performance.
  • Benchmark state-of-the-art offerings in various DL models inference and perform competitive analysis.
  • Collaborate with SW/HW co-design teams for next-gen AI-powered services.

Requirements:

  • PhD in CS, EE, or equivalent experience.
  • 3+ years of experience in deep learning and neural networks.
  • Experience with performance profiling and optimization for GPU-based applications.
  • Proficient in C++, PyTorch, or equivalent frameworks.
  • Deep understanding of computer architecture and GPU fundamentals.

Preferred Qualifications:

  • Experience with processor and system-level performance optimization.
  • Understanding of modern LLM architectures.
  • Strong fundamentals in algorithms.
  • GPU programming experience (CUDA or OpenCL) is a plus.

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.

Apple Google Amazon Meta OpenAI Microsoft Nvidia Stripe TikTok Netflix Uber Airbnb Booking Spotify Canva Pinterest
or use email

Similar Big Tech Jobs - Posted in the Past 24h

🔍 Google

Software Developer III, Full Stack

place Waterloo, ON, Canada
🔍 Google

Software Developer, Enterprise Chat, Google Cloud

place Waterloo, ON, Canada
🔍 Google

Software Developer III, Embedded Systems/Firmware, XR

place San Jose, CA, USA ; Miami, FL, USA ; +1 more
Stanislav Prigodich

Hey, I'm Stan

Software Developer & Creator of Top Jobs Today

I'm a software developer, and over time I realized I cared mostly about roles at big tech companies - not just whatever happened to show up on LinkedIn or generic job boards. But those sources weren't enough - some roles were delayed, or never posted at all.

So I built this website to solve that. It scrapes fresh job postings directly from official company sites, figures out what kind of roles they really are, and sends them as email alerts - simple, fast, and focused.

Hope it makes your search easier too. Wishing you the best of luck - and I'm really glad you're here!