Nvidia
Multimodal Deep Learning Solution Architect - Vision Language and Action Models
Found: October 2, 2025
This role is based remotely with options across multiple locations: France, Poland, Spain, Switzerland, and Germany.
Responsibilities:
- Serve as the primary technical expert between NVIDIA and customers, providing AI solutions and guidance.
- Build proof-of-concepts and demonstrations for Vision Language Reasoning Models.
- Collaborate with developers, researchers, and executives to integrate NVIDIA technology.
- Work with Engineering, Product, and Sales teams to develop suitable solutions based on customer feedback.
Requirements:
- MS/PhD or equivalent in Computer Science, Data Science, or related fields.
- Deep expertise in AI/Deep Learning, particularly in training or optimizing Vision Language Models.
- Experience with deep learning frameworks (e.g., PyTorch, Nemo) and optimization tools (e.g., TensorRT).
- 5+ years of experience in software development with Python/C++.
- Strong communication and collaboration skills.
Preferred Qualifications:
- Familiarity with Cosmos-Reason and Isaac GR00T.
- Experience in large-scale training of Vision Language Models.
- Track record in Neural Networks inference optimization for Physical AI use cases.