Nvidia
Multimodal Deep Learning Solution Architect - Vision Language and Action Models
Found: October 2, 2025
This role is based remotely in multiple locations across Europe including France, Poland, Spain, Switzerland, and Germany.
What you'll do:
- Serve as the primary technical expert for NVIDIA customers, providing guidance on AI solutions and training processes.
- Build proof-of-concepts showcasing NVIDIA AI platforms for Vision Language Reasoning Models.
- Collaborate with developers, researchers, and executives to integrate NVIDIA technology effectively.
- Work with Engineering, Product, and Sales teams to develop solutions based on customer feedback.
What we need to see:
- MS/PhD in Computer Science, Data Science, or related fields.
- Deep expertise in AI/Deep Learning and experience with training or optimizing Vision-Language Models.
- Familiarity with deep learning frameworks (e.g., PyTorch, Nemo) and optimization tools (e.g., TensorRT).
- 5+ years of experience with Python/C++ and strong communication skills.
Ways to Stand Out:
- Experience with Cosmos-Reason and Isaac GR00T.
- Track record in large scale training and customization of Vision-Language Models.
- Experience in Neural Networks inference optimization for Physical AI use cases.