LeethubLeethub
JobsCompaniesBlog
Go to dashboard

Leethub

Curated tech jobs from FAANG and top companies worldwide.

Top Companies

  • Google Jobs
  • Meta Jobs
  • Amazon Jobs
  • Apple Jobs
  • Netflix Jobs
  • All Companies →

Job Categories

  • Software Engineering
  • Data, AI & Machine Learning
  • Product Management
  • Design & User Experience
  • Operations & Strategy
  • Remote Jobs
  • All Categories →

Browse by Type

  • Remote Jobs
  • Hybrid Jobs
  • Senior Positions
  • Entry Level
  • All Jobs →

Resources

  • Google Interview Guide
  • Salary Guide 2025
  • Salary Negotiation
  • LeetCode Study Plan
  • All Articles →

Company

  • Dashboard
  • Privacy Policy
  • Contact Us
© 2026 Leethub LLC. All rights reserved.
Home›Jobs›Roblox›Principal Site Reliability Operations Engineer
Roblox

About Roblox

Empowering creators in a vibrant gaming universe

🏢 Tech, Gaming👥 1001+ employees📅 Founded 2006📍 South San Mateo, San Mateo, CA💰 $922.8m⭐ 3.8
B2CGamingEntertainmentCommunity

Key Highlights

  • Over 200 million monthly active users globally
  • More than $500 million paid to developers in 2022
  • Headquartered in South San Mateo, CA
  • $922.8 million raised in Series G funding

Roblox is an online gaming and entertainment platform headquartered in South San Mateo, CA, that connects over 200 million monthly active users. The platform empowers its community to create and monetize their own games, with over $500 million paid out to developers in 2022 alone. As a leader in the...

🎁 Benefits

Roblox offers competitive salaries, equity options, generous PTO policies, and a flexible remote work policy to support work-life balance. Employees a...

🌟 Culture

Roblox fosters a creator-centric culture, encouraging employees to innovate and collaborate while prioritizing user safety. The company values communi...

🌐 Website💼 LinkedIn𝕏 TwitterAll 219 jobs →
Roblox

Principal Site Reliability Operations Engineer

Roblox • San Mateo, CA, United States

Posted 2w ago🏛️ On-SitePrincipalSite reliability engineer📍 San mateo
Apply Now →

Skills & Technologies

AWSDockerKubernetesLinuxIncident management

Job Description

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. 

At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. 

A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.

As a Senior Site Reliability Operations Engineer on the Reliability Team, you will manage production incidents and improve Roblox's incident processes while reporting to the Senior Operations Manager. You will maintain reliability service-level objectives, drive incidents tenaciously to resolution, and work with service teams towards appropriate action items during the incident postmortem process. If you are passionate about maintaining uptime in a complex distributed environment full of continuous change, you'll be right at home with our Reliability team.You will report to the Senior Manager, Reliability Response.

You Will:

  • Lead and manage production incidents.
  • Collaborate cross-functionally to troubleshoot and resolve sophisticated technical challenges.
  • Guide the implementation of incident management processes and procedures, ensuring fast and effective responses to minimize impact.
  • Continually monitor system health, performance and capacity, proactively addressing potential issues.
  • Conduct comprehensive post-mortem analysis to ascertain the root cause of incidents and formulate corrective measures.
  • Contribute substantially to the design and enhancement of system architecture to boost reliability and performance.
  • Leverage coding skills to automate daily routine tasks and enhance system efficiency.
  • Serve in the Incident Manager On-Call rotation.
  • Mentor junior team members.

You Have:

  • At least 8+ years of experience in a comparable role within a Site Reliability Team.
  • Advanced knowledge of systems and network infrastructure protocols.
  • Demonstrated ability in managing, troubleshooting, and resolving incidents in distributed environments.
  • Experience solving problems.
  • An ability to distill complex technical issues into clear and concise language.
  • Familiarity with at least one scripting or programming language to automate routine tasks (Python, Golang, or similar languages preferred).
  • Bachelor's degree or equivalent experience in Computer Science, Computer Engineering, or a similar technical field

You Are:

  • A great communicator; you are able to explain complex systems clearly to stakeholders and fellow engineers.
  • Able to operate in potentially ambiguous circumstances during a production incident.
  • Familiar with the interactions of services in a distributed system.
  • Tenacious towards driving challenging production incidents to resolution.

For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on this page.

Annual Salary Range
$229,850—$266,080 USD

Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).

Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations to candidates with qualifying disabilities or religious beliefs during the recruiting process.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Roblox.

Apply Now →Get Job Alerts