
Transforming healthcare with AI-driven cost reduction
Machinify, founded in 2016 and headquartered in Palo Alto, CA, is an AI and data-driven platform focused on reducing healthcare costs. The company has raised $15.6 million in funding and serves market leaders in the healthcare sector, helping Payers implement AI for treatment authorization and medic...
Machinify offers flexible remote work options across the US, a home office stipend, and premium medical, dental, and vision benefits. Employees enjoy ...
Machinify fosters a culture centered on leveraging AI to solve real-world healthcare challenges. The company emphasizes operational efficiency and dat...

Machinify • Remote/Palo Alto, CA
Machinify is hiring a Senior Data Engineer to transform raw external data into actionable datasets that drive operational decisions. You'll work with Python, Spark SQL, and Airflow to build and refine production pipelines. This role requires experience in data engineering and a strong understanding of healthcare data.
You have 5+ years of experience in data engineering, with a strong focus on building and maintaining production-grade data pipelines. Your expertise in Python and Apache Spark allows you to efficiently process high-volume datasets, ensuring data accuracy and reliability. You understand the importance of data observability and have experience implementing monitoring solutions to track data quality.
You are comfortable working in a collaborative environment, engaging with product managers, data scientists, and engineers to understand data requirements and deliver solutions that meet business needs. Your ability to communicate complex technical concepts to non-technical stakeholders makes you an effective team player.
You have a solid understanding of healthcare data standards and practices, which enables you to canonicalize raw healthcare data effectively. Your experience with data integration and onboarding new customers is a key asset in this role.
Experience with machine learning models and how data pipelines support their training and deployment is a plus. Familiarity with data visualization tools and techniques will help you present insights derived from the data you manage.
In this role, you will design and implement robust, production-grade data pipelines using Python, Spark SQL, and Airflow. Your primary responsibility will be to transform raw external data into trusted datasets that drive payment, product, and operational decisions. You will lead efforts to canonicalize healthcare data, ensuring it meets the necessary standards for integration into internal models.
You will collaborate closely with product managers and data scientists to understand their data needs and build scalable solutions that support their objectives. Your work will directly impact the company's machine learning models and core product experiences, making your contributions vital to the organization's success.
You will also play a critical role in onboarding new customers, integrating their raw data into our systems, and ensuring that the data is accurate and actionable. Your ability to own end-to-end workflows will be essential as you shape data standards and drive impact in a fast-moving environment.
At Machinify, we offer a dynamic work environment where innovation is encouraged, and your contributions are valued. You will have the opportunity to work with a talented team dedicated to transforming healthcare through data intelligence. We provide competitive compensation and benefits, along with opportunities for professional growth and development. Join us in our mission to maximize financial outcomes and drive down healthcare costs through data-driven solutions.
Apply now or save it for later. Get alerts for similar jobs at Machinify.