Nvidia
Multimodal Deep Learning Solution Architect - Vision Language and Action Models
Found: October 2, 2025
This role is based remotely across multiple locations in Europe: France, Poland, Spain, Switzerland, and Germany.
What you'll do:
- Serve as the primary technical expert between NVIDIA and customers, providing AI solutions and guidance.
- Build proof-of-concepts and demonstrations for Vision Language Reasoning Models.
- Collaborate with developers, researchers, and IT professionals to integrate NVIDIA technology.
- Work with Engineering, Product, and Sales teams to develop suitable solutions based on customer feedback.
What we need to see:
- MS/PhD or equivalent experience in relevant fields.
- Deep expertise in AI/Deep Learning with hands-on experience in training or optimizing Vision Language Models.
- Experience with deep learning frameworks (e.g., PyTorch, Nemo) and optimization tools (e.g., TensorRT, Triton Inference Server).
- Excellent communication skills in English.
- 5+ years of experience in software development (Python/C++).
Ways to Stand Out:
- Familiarity with Cosmos-Reason and Isaac GR00T.
- Experience in large scale training and customization of Vision Language Models.
- Experience in Neural Networks inference optimization for Physical AI use cases.