
Empowering every person and organization on the planet
Microsoft Corporation, headquartered in Redmond, Washington, is a leading technology company known for its software products like Windows and Office, as well as cloud services through Azure. With over 100,000 employees, Microsoft serves millions of customers globally, including major enterprises lik...
Microsoft offers competitive salaries, stock options, generous PTO policies, and comprehensive health benefits. Employees also enjoy a flexible remote...
Microsoft fosters a culture of innovation and inclusivity, emphasizing collaboration across teams and a commitment to diversity. The company values em...

Microsoft • United States, Multiple Locations, Multiple Locations
Microsoft is hiring a Software Engineer II to contribute to the Azure Compute team, focusing on building fault-tolerant distributed systems. You'll work with Azure technologies and apply AI and machine learning to enhance platform resilience.
You have a solid background in software engineering, with experience in designing and operating distributed systems. You understand the importance of fault tolerance and have a keen interest in cloud technologies, particularly Azure. Your collaborative spirit allows you to work effectively with data scientists and researchers, driving improvements in self-healing capabilities across platforms. You are data-informed in your decision-making and are comfortable with automated workflows that enhance system reliability. Your growth mindset aligns with Microsoft’s mission to empower every person and organization.
Experience with predictive failure models and automated root cause analysis is a plus. Familiarity with generative AI applications in diagnostics will set you apart. You thrive in environments that encourage innovation and are eager to contribute to long-term investments in technology and people.
As a Software Engineer II, you will play a crucial role in the Azure Compute team, focusing on delivering a highly elastic and consistently available infrastructure for cloud applications. You will contribute to the design and implementation of systems that meet stringent service-level agreements. Your work will involve developing automated repair workflows and monitoring services that ensure the health of millions of Azure machines. You will collaborate closely with cross-functional teams to enhance platform resilience through AI and machine learning initiatives. Your contributions will directly impact the customer experience by reducing downtime and improving service reliability. You will also engage in iterative development processes, responding rapidly to customer feedback to refine and enhance the systems you help build.
At Microsoft, you will be part of a culture that values innovation and collaboration. You will have opportunities for professional growth and development, working alongside talented individuals who share a commitment to excellence. The company offers a supportive environment where you can thrive and make a meaningful impact on the technology landscape. You will be encouraged to explore new ideas and technologies, contributing to projects that empower users and organizations worldwide.
Apply now or save it for later. Get alerts for similar jobs at Microsoft.