Nvidia
Multimodal Deep Learning Solution Architect - Vision Language and Action Models
Found: October 2, 2025
This role is based in multiple remote locations including France, Poland, Spain, Switzerland, and Germany.
What you'll do:
- Serve as the primary technical expert between NVIDIA and customers, providing AI solutions and guidance.
- Build proof-of-concepts and demonstrations for Vision Language Reasoning Models.
- Collaborate with developers, researchers, and executives to integrate NVIDIA technology.
- Work with Engineering, Product, and Sales teams to develop suitable solutions based on customer feedback.
What we need to see:
- MS/PhD or equivalent experience in relevant fields.
- Deep expertise in AI/Deep Learning, with hands-on experience in training VLMs.
- Experience with deep learning frameworks (e.g., PyTorch, Nemo) and optimization methods (e.g., TensorRT).
- 5+ years of experience in software development (Python/C++).
- Strong communication skills in English.
Ways to Stand Out:
- Familiarity with Cosmos-Reason and Isaac GR00T.
- Experience in large scale training and customization of VLM.
- Track record in Neural Networks inference optimization for Physical AI use cases.