
Empowering the world through technology and information
Google LLC, headquartered in Mountain View, California, is a global leader in internet-related services and products, including its flagship search engine, Google Search, and the Android operating system. With over 100,000 employees, Google also offers cloud computing services through Google Cloud P...
Google offers competitive salaries, equity options, generous PTO policies, comprehensive health benefits, and a remote work policy that allows flexibi...
Google is known for its engineering-first culture, emphasizing innovation and collaboration. The company fosters a unique environment that encourages ...

Google • San Francisco, CA, USA
Google is seeking a Site Reliability Engineer III to ensure the reliability and uptime of its services. You'll work with Java and Python to design and troubleshoot large-scale distributed systems. This role requires a Bachelor's degree in Computer Science and 2 years of relevant experience.
You hold a Bachelor’s degree in Computer Science or a related field, and you have at least 2 years of experience in software development using one or more programming languages. Your background includes designing, analyzing, and troubleshooting large-scale distributed systems, which is essential for the Site Reliability Engineering (SRE) role. You are familiar with the principles of SRE, combining software and systems engineering to build and run large-scale, fault-tolerant systems. You possess a strong understanding of coding, algorithms, and complexity analysis, which allows you to tackle the unique challenges of scale that Google faces. You thrive in a culture of intellectual curiosity and problem-solving, and you are eager to collaborate with a diverse team to drive improvements in system reliability and performance.
A Master’s degree in Computer Science or Engineering is preferred, as well as additional experience in optimizing existing systems and building infrastructure through automation. You are comfortable working in a blame-free environment that promotes self-direction and meaningful project work. Your ability to adapt documentation based on user feedback and product updates is a valuable asset.
As a Site Reliability Engineer at Google, you will be responsible for ensuring that both internally critical and externally-visible systems maintain reliability and uptime that meets user needs. You will monitor system capacity and performance, proactively addressing potential issues before they impact users. Your role will involve triaging product or system issues, debugging, and tracking resolutions by analyzing the sources of issues and their impact on hardware, network, or service operations. You will participate in or lead design reviews with peers and stakeholders to evaluate available technologies and make informed decisions. Your contributions will help shape the future of Google's services, ensuring they remain robust and efficient.
At Google, you will have the opportunity to work on complex challenges unique to our scale, using your expertise to make a significant impact. We foster a collaborative environment where diverse perspectives are valued, and we encourage you to apply even if your experience doesn't match every requirement. You will receive support and mentorship to help you grow in your career while working on meaningful projects that drive innovation in technology and user experience.
Apply now or save it for later. Get alerts for similar jobs at Google.