
Empowering engineers with PCB design software
Altium Limited, headquartered in San Diego, California, specializes in PCB design software, empowering engineers with tools like Altium Designer and Altium 365. With over 50,000 users globally, Altium went public in 1999 and has consistently focused on enhancing the design process for electronics. T...
Altium offers competitive salaries, stock options, a generous PTO policy, remote work flexibility, and a learning budget for professional development....
Altium fosters a culture centered on innovation and user-centric design, encouraging employees to contribute ideas that enhance product functionality ...

Altium • Lisbon, Portugal Office
Altium is seeking a Site Reliability Engineer to ensure the reliability and performance of their cloud platforms. You'll work with technologies like AWS, Docker, and Kubernetes to automate operational tasks and improve observability. This role requires a blend of software engineering and systems administration skills.
You have a strong background in site reliability engineering, with experience in ensuring the reliability, availability, and performance of large-scale software systems. You understand the importance of automating operational tasks and have a knack for improving observability through logging and monitoring. Your experience with incident management allows you to effectively collaborate with development and technology teams to build more reliable and scalable applications. You are familiar with cloud platforms and have a solid understanding of how they operate, which enables you to pioneer improvements in system reliability. You are comfortable working in a fast-paced environment and are eager to contribute to the success of the Altium Cloud Platforms.
Experience with specific reliability frameworks and patterns that standardize resilience across SaaS products is a plus. Familiarity with tools like Prometheus for monitoring and alerting will enhance your ability to proactively detect issues. You may also have experience with containerization technologies such as Docker and orchestration tools like Kubernetes, which are essential for managing cloud infrastructure effectively.
As a Site Reliability Engineer at Altium, you will be responsible for ensuring the reliability and performance of our cloud platforms. You will automate operational tasks to improve efficiency and reduce manual intervention. Your role will involve developing and implementing reliability frameworks that elevate the resilience of our SaaS products across multiple regions and environments. You will work closely with development teams to enhance observability, focusing on logging, monitoring, and application performance management (APM). Your contributions will help in proactive issue detection and resolution, ensuring that our systems remain robust and performant.
You will also engage in incident management, collaborating with cross-functional teams to address and resolve issues swiftly. Your insights will drive improvements in system architecture and operational processes, contributing to the overall success of our technology initiatives. You will have the opportunity to influence the reliability practices within the organization and help shape the future of our cloud offerings.
At Altium, we provide a supportive work environment where innovation thrives. You will be part of a team that values collaboration and continuous improvement. We offer competitive compensation and benefits, along with opportunities for professional growth and development. Our office in Lisbon is designed to foster creativity and teamwork, and we encourage you to bring your unique perspective to the table. Join us in transforming the way electronics are designed and built, and be part of a company that is committed to making a significant impact in the EDA industry.
Apply now or save it for later. Get alerts for similar jobs at Altium.