
Simplifying cloud data management for enterprises
Rubrik is a cloud data management company headquartered in Palo Alto, California, specializing in data backup, recovery, and security solutions. Founded in December 2013, Rubrik has raised over $1.3 billion in funding from investors like Greylock Partners and IVP, and serves over 3,000 customers, in...
Rubrik offers competitive salaries, equity options, generous PTO policies, and a flexible remote work policy to support work-life balance....
Rubrik fosters a culture of innovation and accountability, encouraging employees to take ownership of their projects and contribute to the company's m...

Rubrik • Bangalore
Rubrik is hiring a Production Engineer to manage critical infrastructure and services in multi-cloud environments. You'll work with Kubernetes, AWS, and Docker to ensure maximum uptime and reliability. This role requires hands-on experience in incident management and automation.
You have a solid understanding of distributed system concepts and practical experience working with production systems and environments, preferably within public cloud infrastructures. Your familiarity with container orchestration platforms, especially Kubernetes, allows you to effectively manage and optimize services in multi-cloud environments. You demonstrate strong decision-making skills under pressure, effectively managing critical situations with urgency and composure. Your experience in incident management has equipped you with the ability to lead teams in swiftly responding to alerts and outages, ensuring timely resolution of issues. You are driven by a desire for continuous improvement and automation, always looking for ways to enhance system resilience and reduce toil.
Experience with observability solutions for real-time monitoring, alerting, and metrics collection is a plus. Familiarity with automation tools for detecting, triaging, and remediating production issues will set you apart. You thrive in a collaborative environment and are eager to contribute to a 24/7 Production Operations team.
As a Production Engineer at Rubrik, you will join a dedicated team responsible for managing and supporting critical infrastructure and services across multi-cloud environments. You will oversee staging and production environments to ensure maximum uptime and reliability, implementing and maintaining comprehensive observability solutions for real-time monitoring and alerting. Your role will involve leading incident management efforts, coordinating teams to drive timely resolution of outages, and analyzing recurring incidents to identify root causes. You will design and develop automation tools to proactively detect and remediate production issues, maintaining and updating runbooks to support incident response and recurring issues. Your contributions will directly impact the operational excellence of Rubrik's services, ensuring that critical systems remain available and resilient.
Rubrik provides a dynamic work environment where you can grow your skills and make a significant impact. You will have the opportunity to work with cutting-edge technologies in a collaborative team setting. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds. Join us in our mission to deliver exceptional service reliability and operational excellence.
Apply now or save it for later. Get alerts for similar jobs at Rubrik.