
Empowering cashless transactions for millions in India
PhonePe, headquartered in Bengaluru, Karnataka, is a leading digital payments platform in India, serving over 400 million users. The company offers a wide range of services including money transfers, bill payments, and merchant transactions, processing over $1 trillion in annual payment volume. With...
PhonePe offers competitive salaries, equity options, generous parental leave, and a flexible remote work policy to support work-life balance....
PhonePe fosters a culture of innovation and agility, encouraging employees to experiment and implement new ideas in the rapidly evolving fintech lands...

PhonePe • Bangalore
PhonePe is hiring a Site Reliability Engineer to maintain the reliability and performance of its fintech infrastructure. You'll work with technologies like Java, AWS, and Docker to ensure seamless operations. This role requires experience in cloud environments and system reliability.
You have a strong background in site reliability engineering with at least 3-5 years of experience in maintaining large-scale systems. Your expertise in cloud environments, particularly AWS, allows you to design and implement robust solutions that enhance system reliability and performance. You are proficient in programming languages such as Java and Python, which you use to automate processes and improve system efficiency. Your experience with containerization technologies like Docker and orchestration tools such as Kubernetes enables you to manage microservices effectively. You are well-versed in Linux systems, which you leverage to troubleshoot and optimize server performance. You understand the importance of infrastructure as code and have hands-on experience with tools like Terraform to manage cloud resources efficiently.
Experience with monitoring and logging tools such as Prometheus or Grafana is a plus, as it helps you maintain system health and performance metrics. Familiarity with CI/CD pipelines and DevOps practices will enhance your ability to streamline deployment processes and improve collaboration with development teams.
In this role, you will be responsible for ensuring the reliability and performance of PhonePe's fintech infrastructure, which supports millions of transactions daily. You will collaborate with cross-functional teams to design and implement solutions that enhance system availability and scalability. Your day-to-day tasks will include monitoring system performance, troubleshooting issues, and implementing automation to reduce manual intervention. You will also participate in incident response and post-mortem analysis to identify root causes and prevent future occurrences. Additionally, you will contribute to the development of best practices for system reliability and performance optimization, ensuring that PhonePe continues to deliver exceptional service to its users.
At PhonePe, we foster a culture of innovation and collaboration, empowering you to take ownership of your work from day one. You will have the opportunity to work with some of the brightest minds in the industry, tackling complex challenges and building solutions that impact millions of users. We offer competitive compensation and benefits, along with a supportive environment that encourages professional growth and development. Join us in our mission to revolutionize digital payments in India and make a difference in the lives of our users.
Apply now or save it for later. Get alerts for similar jobs at PhonePe.