
Empowering product teams with open-source analytics
PostHog is a San Francisco-based company that provides an open-source product analytics platform, enabling businesses to optimize their product development. With over 10,000 customers, including notable brands like Microsoft and Intercom, PostHog offers tools such as heatmaps, session recording, fea...
PostHog offers competitive salaries, equity options, flexible remote work policies, unlimited PTO, and a learning budget to support employee developme...
PostHog fosters a transparent culture by making its entire handbook public, emphasizing product-led growth and encouraging team members to take owners...

PostHog • Remote
PostHog is hiring a Software Engineer for their Warehouse Pipeline team to build and iterate on their data import system. You'll work with Python, Apache Arrow, and Kubernetes to manage data workflows. This position requires experience in building resilient systems and debugging complex data processes.
You have a strong background in software engineering, particularly with Python, and are experienced in building data import systems. You understand the intricacies of managing data workflows and have a knack for debugging stateful systems. Your familiarity with Kubernetes allows you to navigate pod metrics effectively, ensuring smooth operations. You are creative in finding solutions to make libraries manageable at scale and are eager to revamp schema management strategies. You thrive in collaborative environments and are committed to building resilient systems that include logging, observability, and testing. You are open to learning and adapting to new technologies as needed, and you value a fair and accessible work culture.
As a Software Engineer on the Warehouse Pipeline team, you will be responsible for building and iterating on PostHog's data import system. This involves developing import workers in Python that pull data from various APIs and databases in batches. You will process this data using Apache Arrow in memory and ensure it is moved into object storage in open table formats. Your role will also include maintaining the source library and implementing creative solutions for scalability. You will revamp the schema management strategy and build resilient systems that incorporate logging, observability, and testing. Debugging stateful data workflows will be a key part of your responsibilities, requiring you to delve into Kubernetes pod metrics. Additionally, you will schedule jobs using Temporal.io, tackling a wide range of challenges and opportunities within the data stack group.
PostHog provides a supportive environment where you can grow your skills and contribute to meaningful projects. You will be part of a team that values open-source principles and product-led development. The company is well-funded and committed to ensuring a fair and accessible interview process. You will have the opportunity to work remotely, allowing for flexibility in your work-life balance. PostHog encourages you to apply even if your experience doesn't match every requirement, as they value diverse perspectives and backgrounds.
Apply now or save it for later. Get alerts for similar jobs at PostHog.