Nvidia
Machine Learning Engineer, LLM Training Datasets
Found: September 16, 2025
This role is based in Santa Clara, CA or can be remote.
Compensation:
$148,000 - $287,500/year depending on level.
Responsibilities:
- Develop datasets for LLM pre-training and post-training.
- Design data strategies for model training and evaluation.
- Generate high-quality synthetic data for various use cases.
- Conduct experiments to optimize Large Language Models.
- Collaborate with ML researchers and data scientists.
Requirements:
- Master’s or PhD in Computer Science or related field.
- 3+ years of experience in developing datasets for large language models.
- Hands-on programming expertise in Python.
- Experience with machine learning frameworks like PyTorch and TensorFlow.