Nvidia

Multimodal Deep Learning Solution Architect - Vision Language and Action Models

6 Locations

Found: October 2, 2025

⚠️ This job posting is no longer active and may not be accepting applications. Browse similar live jobs below, or see all current Nvidia jobs.

This role is based remotely with options across multiple locations: France, Poland, Spain, Switzerland, and Germany.

Responsibilities:

Serve as the primary technical expert between NVIDIA and customers, providing AI solutions and guidance.
Build proof-of-concepts and demonstrations for Vision Language Reasoning Models.
Collaborate with developers, researchers, and executives to integrate NVIDIA technology.
Work with Engineering, Product, and Sales teams to develop suitable solutions based on customer feedback.

Requirements:

MS/PhD or equivalent in Computer Science, Data Science, or related fields.
Deep expertise in AI/Deep Learning, particularly in training or optimizing Vision Language Models.
Experience with deep learning frameworks (e.g., PyTorch, Nemo) and optimization tools (e.g., TensorRT).
5+ years of experience in software development with Python/C++.
Strong communication and collaboration skills.

Preferred Qualifications:

Familiarity with Cosmos-Reason and Isaac GR00T.
Experience in large-scale training of Vision Language Models.
Track record in Neural Networks inference optimization for Physical AI use cases.

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.