Canva
Engineering Manager (Infra) - AI Reliability (Sydney based)
Found: Today
This role is based in Sydney, Australia with hybrid work options available.
Responsibilities:
- Building world-class AI infrastructure to support a 100+ person research team.
- Designing and scaling multi-cloud systems for model training and inference.
- Enhancing CI/CD pipelines and developer velocity.
- Improving monitoring and system observability for AI workloads.
- Driving alignment in DevOps best practices.
- Leading a high-impact engineering team.
Requirements:
- Experience leading DevOps or infrastructure teams, ideally in AI or high-performance computing.
- Familiarity with AWS, GCP, Cloudflare, or GCore.
- Experience with Kubernetes, SLURM, or similar infrastructure.
- Fluency in infrastructure as code tools like Terraform.
- Strong grasp of containerisation, Linux fundamentals, and cloud networking.
About the team:
You’ll be joining CORE (Canva Original Research & Exploration), our in-house AI research lab.