Nvidia
Multimodal Deep Learning Solution Architect - Vision Language and Action Models
Found: December 19, 2025
This role is based in Munich, Germany.
Responsibilities:
- Serve as the primary technical expert between NVIDIA and customers, providing AI solutions and guidance.
- Build proof-of-concepts and demonstrations for Vision Language Reasoning Models.
- Collaborate with developers, researchers, and executives to integrate NVIDIA technology.
- Work with Engineering, Product, and Sales teams to develop suitable solutions based on customer feedback.
Requirements:
- MS/PhD or equivalent experience in relevant fields.
- Deep expertise in AI/Deep Learning and experience with VLMs.
- Proficiency in deep learning frameworks (e.g., PyTorch, Nemo) and optimization methods (e.g., TensorRT).
- Excellent communication and presentation skills in English.
- 5+ years of experience in software development (Python/C++).
Ways to Stand Out:
- Familiarity with Cosmos-Reason and Isaac GR00T.
- Experience in large-scale training and customization of VLM.
- Track record in Neural Networks inference optimization for Physical AI use cases.