✨ AI Summary
Apple is seeking a Technical Operations & Site Reliability Engineer for their Customer Systems team. You'll be responsible for maintaining the reliability and performance of globally distributed systems while developing automation solutions using Java, Python, and Go. This role requires strong software engineering skills and offers an opportunity to work in a fast-paced environment in Sunnyvale.
Job Description
At Apple, Customer Experience is at the forefront of everything we do. The Customer Systems Operations team is looking for a highly skilled and motivated TechOps Engineer (Technical Operations & Site Reliability) to join us. The team is responsible for maintaining the reliability, availability, and performance of business-critical, globally distributed systems.
If you have the desire and motivation to design and develop automation solutions to streamline system sustenance, monitoring, and operational workflows, while collaborating closely with support, engineering and business operations teams, this profile is for you. Ideal candidates will combine a passion for operational excellence with strong software engineering skills, and thrive in a fast-paced, change-driven environment focused on continuous improvement and flawless delivery.
Manage large-scale production outages, leading incident response and improving efficiency.
Design, build, and maintain automation solutions to streamline the monitoring, sustenance, and management of large-scale distributed systems.
Develop tools and software (using Java/JEE, REST, Swift/Objective C, Python, Go, or Bash) to automate repetitive operational tasks, reduce manual intervention, and improve system reliability. Utilize AI & LLM models to achieve Operational Excellence in application support.
Plan and execute actionable system health monitoring, incident response, and communication across critical global applications. Drive operational metrics and KPI identification and alignment.
Partner with multi-functional teams to improve reliability, efficiency, stability, and processes.
Be a self-directed problem-solver exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner.
Create and maintain accurate, up-to-date documentation reflecting architecture, infra configuration, and procedures. Write status and incident reports. Write training material and train users in complex topics.
Partner with a team of highly skilled engineers across the globe and guide their work towards operational excellence, gaining efficiency.
Build a culture where the regional members are responsible for cultivating strong in-region relationships and getting results for our business partners ensuring they remain informed about significant incidents and problems.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Apple.