
Empowering learners through accessible online education
Udemy is a leading online learning platform headquartered in San Francisco, California, offering over 130,000 courses to a global community of 35 million students. The platform provides a diverse range of subjects including programming, marketing, and data science, catering to both individual learne...
Udemy offers competitive salaries, equity options, generous PTO policies, and a remote work flexibility that allows employees to balance their work an...
Udemy fosters a culture of continuous learning and innovation, encouraging employees to enhance their skills through access to their own courses and a...

Udemy • Dublin, Ireland
Udemy is hiring a Staff Site Reliability Engineer to manage and evolve their infrastructure. You'll work with AWS, Kubernetes, and programming languages like Python and Golang. This role requires extensive knowledge of cloud technologies and infrastructure-as-code tools.
You have extensive knowledge of cloud technologies, with AWS experience being highly advantageous. Your proven expertise in managing containerized workloads using Kubernetes in production environments sets you apart. You are proficient in programming languages such as Python, Golang, or Kotlin, and have a strong familiarity with infrastructure-as-code (IaC) tools like Terraform and Helm. You thrive in collaborative environments and are eager to enhance reliability standards across the organization.
Experience with incident response and driving best practices in reliability is a plus. Familiarity with CI/CD pipelines and monitoring tools will help you excel in this role. You are a proactive problem solver who enjoys optimizing infrastructure and tooling to empower engineering teams.
As a Staff Site Reliability Engineer at Udemy, you will play a critical role in managing and evolving our infrastructure, from our CDN to our databases. You will oversee and improve tools like Helm and Terraform, building development environments that empower our engineering teams. Collaborating closely with development teams, you will design internal tools in Python and Golang while responding to incidents and driving best practices in reliability. You will lead projects to enhance and optimize our infrastructure and tooling, ensuring that our systems are robust and scalable. Your work will directly impact the learning experience of millions of users worldwide, making it essential to maintain high reliability standards.
At Udemy, we are committed to transforming lives through learning. You will be part of a mission-driven team that values innovation and collaboration. We offer competitive compensation and benefits, along with opportunities for professional growth and development. You will work in a supportive environment that encourages you to share your unique experiences and perspectives. Join us in shaping the future of learning and making a real impact on people's lives around the world.
Apply now or save it for later. Get alerts for similar jobs at Udemy.