Manager, Site Reliability Engineer - DGX Cloud

Nvidia logo Nvidia

📍 India, Remote

Scraped: Yesterday

Location:

India, Remote

What you'll be doing:

  • Recruit and mentor a team of Site Reliability Engineers.
  • Establish SRE practices including SLOs, SLIs, and incident management.
  • Collaborate with engineering teams to design scalable cloud services.
  • Drive automation across service lifecycle.
  • Implement monitoring and alerting solutions.
  • Oversee incident response and lead post-mortems.

What we need to see:

  • Bachelor's or Master's degree in a related field.
  • 10+ years in Site Reliability Engineering or DevOps, with 5 years in a leadership role.
  • Experience with cloud environments (AWS, GCP, Azure).
  • Expertise in Kubernetes, containerization, and microservices.
  • Strong understanding of SRE principles and infrastructure automation tools.
  • Proficiency in programming languages like Python or Go.

Fresh Big Tech Jobs in One Place

Get fresh, high-paying jobs daily straight to your email from Apple, Google, Amazon, Meta, Nvidia, Stripe, Microsoft, Netflix, Tesla, Uber, Airbnb, TikTok, Spotify, Booking.com, Pinterest, Canva, OpenAI, and others.

Why I Created Top Jobs Today

Stanislav Prigodich

Hey, I’m Stan 👋

I’m a software developer, and over time I realized I cared mostly about roles at big tech companies - not just whatever happened to show up on LinkedIn or generic job boards. But those sources weren’t enough - some roles were delayed, or never posted at all.

So I built this project to solve that. It scrapes fresh job postings directly from official company sites, figures out what kind of roles they really are, and sends them as email alerts - simple, fast, and focused.

Hope it makes your search easier too. Wishing you the best of luck - and I’m really glad you’re here!

Connect with me on LinkedIn
Reddit Join my r/FAANGJobs Community