
Empowering IT professionals with powerful management tools
SolarWinds Inc. is a leading provider of IT management software, headquartered in Austin, Texas. The company offers a range of products including network performance monitoring, systems management, and IT security solutions, serving over 300,000 customers worldwide, including major organizations lik...
Employees enjoy competitive salaries, stock options, generous PTO policies, remote work flexibility, and comprehensive health benefits....
SolarWinds fosters a culture focused on customer success and product excellence, with a strong emphasis on engineering and innovation in IT management...

SolarWinds • Bangalore, India
SolarWinds is seeking a Senior Site Reliability Engineer to enhance their infrastructure and site reliability practices. You'll work with AWS, GCP, Kubernetes, and GitOps to ensure high-quality service delivery. This role requires experience in implementing SRE practices and collaboration with cross-functional teams.
You have a strong background in site reliability engineering with at least 5 years of experience in the field. Your expertise includes working with cloud platforms such as AWS and GCP, and you are well-versed in container orchestration using Kubernetes. You understand the importance of GitOps and have implemented it in your previous roles to streamline deployment processes. You thrive in collaborative environments and enjoy working closely with software engineering teams to define and enhance infrastructure. Your approach to SRE practices is proactive, focusing on SLAs, SLOs, and incident management to ensure system reliability and performance. You are committed to continuous improvement and are always looking for ways to optimize processes and enhance service delivery.
Experience with monitoring tools and incident response frameworks is a plus. Familiarity with infrastructure as code (IaC) practices and tools such as Terraform or CloudFormation will set you apart. You are also encouraged to bring any additional skills in automation and scripting languages that can contribute to the efficiency of the SRE team.
In this role, you will be responsible for developing and operating the infrastructure that supports both development and production environments. You will collaborate with cross-functional engineering teams to define infrastructure requirements and ensure that systems are designed for reliability and scalability. Your day-to-day tasks will include implementing monitoring and alerting systems to proactively manage incidents and ensure that SLAs and SLOs are met. You will also conduct postmortems and reviews to learn from incidents and improve processes. As a senior member of the team, you will mentor junior engineers and help foster a culture of accountability and continuous learning within the SRE team.
At SolarWinds, we value our employees and offer a supportive work environment that encourages growth and development. You will have the opportunity to work on innovative projects that have a real impact on our customers' success. We provide competitive compensation and benefits, along with opportunities for professional development and career advancement. Join us in our mission to deliver powerful and secure solutions that help our customers accelerate their business transformation.
Apply now or save it for later. Get alerts for similar jobs at SolarWinds.