Nvidia
Senior Systems Engineer, Artificial Intelligence Operations
Found: October 28, 2025
Location:
UK, Remote; Finland, Remote; France, Remote; Spain, Remote; Sweden, Remote
What you'll do:
- Understand customer requirements to enhance AI cluster resiliency and design AIOps-based solutions.
- Develop automated workflows for issue detection and root cause analysis, collaborating with operators to debug complex AI cluster problems.
- Deliver technical presentations and lead hands-on demos or training, ensuring smooth installations throughout the customer journey.
What we need to see:
- Bachelor of Science or equivalent experience.
- 12+ years of networking experience in enterprise or service provider environments.
- Proficient in scripting and automation using Python or similar languages, with strong Linux expertise.
- Experience in Systems Engineer or SRE roles, directly resolving customer issues.
- Exceptional communication skills for conveying complex technical topics.
- Ability to collaborate effectively across teams.
Ways to stand out:
- Experience with data center infrastructure and cloud architectures.
- Background in network performance monitoring or observability.
- Previous experience at a technological start-up.