
Empowering the world through technology and information
Google LLC, headquartered in Mountain View, California, is a global leader in internet-related services and products, including its flagship search engine, Google Search, and the Android operating system. With over 100,000 employees, Google also offers cloud computing services through Google Cloud P...
Google offers competitive salaries, equity options, generous PTO policies, comprehensive health benefits, and a remote work policy that allows flexibi...
Google is known for its engineering-first culture, emphasizing innovation and collaboration. The company fosters a unique environment that encourages ...

Google • Sydney NSW, Australia
Google is seeking a Senior Site Reliability Engineer to innovate reliability within the Android Ecosystem. You'll leverage your expertise in software development and systems engineering to enhance service uptime and performance. This role requires 5+ years of experience in software development and a strong background in testing and maintaining software products.
You have a Bachelor's degree in Computer Science or a related technical field, or equivalent practical experience. With 5 years of experience in software development across various programming languages, you have a solid foundation in coding and software design. Your 3 years of experience in testing, maintaining, or launching software products has equipped you with the skills to ensure high reliability and performance in large-scale systems. You also possess at least 1 year of experience with software design and architecture, allowing you to contribute effectively to complex projects. A Master's degree in Computer Science or a related field is preferred, showcasing your commitment to advancing your knowledge in the tech industry.
Your expertise in Site Reliability Engineering (SRE) combines software and systems engineering, enabling you to build and run large-scale, massively distributed, fault-tolerant systems. You understand the importance of ensuring that services have the reliability and uptime that users expect, and you are driven to improve these metrics continuously. You thrive in a culture of intellectual curiosity and problem-solving, and you appreciate working in a collaborative environment where diverse perspectives are valued. You are motivated by the opportunity to manage complex challenges unique to Google, utilizing your skills in coding, algorithms, and large-scale system design.
You are eager to explore new frontiers in Site Reliability Engineering, particularly in innovating reliability for the Android Ecosystem on end-user devices. Your experience in designing and implementing tooling and services for incident response will be invaluable in this role. You are comfortable participating in on-call rotations and are ready to take on the responsibility of ensuring device reliability.
In this role, you will be responsible for enhancing the reliability of Google's services, focusing on both internal and external systems. You will work on optimizing existing systems and building infrastructure that eliminates manual work through automation. Your contributions will directly impact the performance and reliability of the Android Ecosystem, ensuring that users have a seamless experience. You will lead Android product decisions by implementing Critical User Journey (CUJ) monitoring, which is crucial for understanding user interactions and improving service quality.
You will collaborate with cross-functional teams to address complex challenges and drive improvements in system capacity and performance. Your role will involve designing and implementing new tooling and services that facilitate safe end-to-end incident response, ensuring that the team can react swiftly to any issues that arise. You will also be expected to mentor junior engineers, sharing your knowledge and expertise to foster a culture of learning and growth within the team.
At Google, you will be part of a dynamic team that values innovation and collaboration. We offer competitive compensation and benefits, including opportunities for professional development and growth. You will have the chance to work on meaningful projects that impact millions of users worldwide, contributing to the reliability and performance of critical systems. Our culture promotes self-direction and encourages you to take risks in a blame-free environment, allowing you to explore your ideas and make a significant impact on the organization.
Apply now or save it for later. Get alerts for similar jobs at Google.