Meta
Software Engineer, SystemML - Scaling / Performance
Found: Today
This role is based in Menlo Park, CA.
Compensation:
$74.04/hour to $217,000/year + bonus + equity + benefits
Responsibilities:
Enable reliable and highly scalable distributed ML training on Meta's large-scale GPU training infrastructure with a focus on GenAI/LLM scaling.
Minimum Qualifications:
- Bachelor's degree in Computer Science, Computer Engineering, or relevant technical field.
- Experience in machine learning/deep learning domains such as Distributed ML Training, GPU architecture, or ML frameworks like PyTorch.
Preferred Qualifications:
- Knowledge of GPU architectures and CUDA programming.
- Experience with deep learning frameworks like PyTorch, Caffe2, or TensorFlow.
- PhD in Computer Science or related field.