Canva
Engineering Manager (Infra) - AI Reliability (ANZ Remote)
Found: February 10, 2026
This role is based in Brisbane, QLD, Australia with hybrid work options available.
Responsibilities:
- Building world-class AI infrastructure to support a 100+ person research team.
- Designing and scaling multi-cloud systems for high-performance model training and inference.
- Partnering with AWS, GCP, Cloudflare, and GCore to optimize GPU compute environments.
- Enhancing CI/CD pipelines and developer velocity.
- Improving monitoring, alerting, and system observability for AI workloads.
- Driving alignment in DevOps best practices across teams.
- Leading a high-impact engineering team in a fast-paced environment.
Requirements:
- Experience leading DevOps or infrastructure teams, ideally in AI or high-performance computing.
- Familiarity with AWS and multi-cloud environments.
- Experience with Kubernetes, SLURM, or similar distributed training infrastructure.
- Fluency in infrastructure as code tools like Terraform.
- Strong grasp of containerization, Linux fundamentals, and cloud networking.
About the team:
You’ll be joining CORE (Canva Original Research & Exploration), our in-house AI research lab focused on building world-class models.