Nvidia

Senior Software Development Engineer in Test

US, CA, Santa Clara

Found: Today

We are seeking a highly skilled and hard-working Senior Test Developer / test engineer to join our multifaceted Enterprise Software QA team. This role offers an outstanding opportunity to leave your mark on the design, construction, optimization and testing of large-scale infrastructure for various foundational NVIDIA unified cloud services and data center offerings. If you are a dedicated engineer with strong expertise in cloud infrastructure and distributed systems and want to apply your skills with AI tools, this role could fit you perfectly. You will thrive in an exciting, innovative environment.

What you'll be doing:

  • Work with development teams on test plans for all layers of SW stack for cloud infrastructure, execution, reviews, failure analysis and assessing overall quality and risk. Work with customer PMs on software issues including technical feedback from OEMs and CSPs. Develop key benchmarks to track execution and deploy process improvements to improve efficiency

  • Leverage AI skills to expedite the test scope, test plan, execution and automation workflows.

  • Lead NVIDIA Cloud and Data Center bring up activities which will involve validation, reporting, working with engineering to debug issues, providing design input at times, adding coverage in different areas.

  • Design, develop and maintain CI/CD pipelines for continuous testing in cloud environments when needed.

  • Perform performance, scalability, and reliability testing of cloud services.

  • Implement and maintain test environments in cloud platforms such as AWS, Azure, or Google Cloud.

  • Supervise the infrastructure to alert on significant events, ensuring the highest level of system performance and reliability.

  • Work with various different partner teams to ensure availability of clusters to test on and take the lead in resolve all issues.

  • Working with teams to ensure quality of the cloud products getting delivered focusing on critical areas like security, storage, workloads, performance on latest SW and FW components.

What we need to see:

  • A Master's or Ph.D. in Computer Science or a related field, or equivalent experience.

  • Experience with AI development tools used in creating test cases, automating test cases, code coverage, triaging.

  • 8+ years of hands-on experience in cluster management and related tools, including Docker Containers, Slurm, Kubernetes, and Ansible.

  • 2+ years strong experience with cloud infrastructure platforms like AWS, Azure, Google, OCI Cloud.

  • Hands-on experience with network, storage, security, cluster configuration and debugging, cloud infrastructure management tools like terraform, ansible.

  • Expertise in administering, operating, and configuring Kubernetes.

  • Experience in CI/CD tools such as Gitlab and Jenkins and the GitOps model.

  • Proficiency in various monitoring tools :Prometheus, Grafana, Cloudwatch, and Thanos.

  • Proficiency in debugging issues involving networks, DHCP, DNS, HTTP, Linux, and containers.

Ways to Stand Out from the Crowd:

  • Familiarity with "Base Command Manager" for managing and monitoring high performance computing.

  • Experience in writing automation for web application using tools like selenium, playwright.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 270,250 USD.

Get jobs like this in your inbox daily

Fresh FAANG jobs, every day, filtered for your role and location.

Apple Google Amazon Meta OpenAI Microsoft Nvidia Stripe TikTok Netflix Uber Airbnb Booking Spotify Canva Pinterest
or use email

Similar Big Tech Jobs - Posted in the Past 24h

🍎 Apple

Quality & Reliability Engineering

Cupertino
🔍 Google

System Level Test Product Owner, Google Cloud

place Sunnyvale, CA, USA
🎮 Nvidia

Senior QA Software Engineer - Networking

US, CA, Santa Clara
Stanislav Prigodich

Hey, I'm Stan

Software Developer & Creator of Top Jobs Today

I'm a software developer, and over time I realized I cared mostly about roles at big tech companies - not just whatever happened to show up on LinkedIn or generic job boards. But those sources weren't enough - some roles were delayed, or never posted at all.

So I built this website to solve that. It scrapes fresh job postings directly from official company sites, figures out what kind of roles they really are, and sends them as email alerts - simple, fast, and focused.

Hope it makes your search easier too. Wishing you the best of luck - and I'm really glad you're here!