Nvidia
Machine Learning Engineer, GeForce G-Assist
Found: February 1, 2026
This role is based in Santa Clara, CA.
Compensation:
$184,000 - $356,500/year
Responsibilities:
- Evaluate and improve Small Language Models for GeForce G-Assist.
- Work with SLM and VLM architectures for text and multimodal interactions.
- Optimize local inference using llama.cpp and enhance C/C++ code performance.
- Design retrieval-augmented generation systems for context-aware responses.
Requirements:
- 8+ years of experience in system software or related fields with an M.S. or higher degree.
- Proficiency in C/C++ and Python, with experience in performance-sensitive environments.
- Hands-on experience with Small Language Models and understanding of conversation dynamics.
- Knowledge of retrieval technologies and agentic AI patterns.