Nvidia

Multimodal Deep Learning Solution Architect - Vision Language and Action Models

Germany, Munich

Found: December 19, 2025

View Details and Apply

This role is based in Munich, Germany.

Responsibilities:

Serve as the primary technical expert between NVIDIA and customers, providing AI solutions and guidance.
Build proof-of-concepts and demonstrations for Vision Language Reasoning Models.
Collaborate with developers, researchers, and executives to integrate NVIDIA technology.
Work with Engineering, Product, and Sales teams to develop suitable solutions based on customer feedback.

Requirements:

MS/PhD or equivalent experience in relevant fields.
Deep expertise in AI/Deep Learning and experience with VLMs.
Proficiency in deep learning frameworks (e.g., PyTorch, Nemo) and optimization methods (e.g., TensorRT).
Excellent communication and presentation skills in English.
5+ years of experience in software development (Python/C++).

Ways to Stand Out:

Familiarity with Cosmos-Reason and Isaac GR00T.
Experience in large-scale training and customization of VLM.
Track record in Neural Networks inference optimization for Physical AI use cases.

View Details and Apply

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.