
Simplifying healthcare access for millions
Doctolib is the leading European platform for online medical appointment scheduling, serving over 17,000 healthcare professionals and connecting with 6 million patients monthly. Headquartered in Levallois-Perret, Île-de-France, Doctolib is present in 435 healthcare facilities across France and Germa...
Employees enjoy competitive salaries, stock options, generous PTO, and a flexible remote work policy, promoting a healthy work-life balance....
Doctolib fosters a culture centered around improving healthcare access, emphasizing technology-driven solutions and a commitment to user experience. T...

Doctolib • Paris, Paris, France
Doctolib is seeking a Senior/Staff Machine Learning Engineer to design and implement evaluation frameworks for AI systems in healthcare. You'll work with Python, TensorFlow, and PyTorch to ensure model quality and safety. This role requires strong experience in machine learning and data analysis.
You have 5+ years of experience in machine learning engineering, with a strong focus on building and evaluating AI systems. Your expertise in Python and frameworks like TensorFlow and PyTorch allows you to develop robust models that can handle complex healthcare data. You understand the importance of model evaluation and have experience in defining metrics and protocols that ensure AI systems behave reliably and safely.
You are comfortable collaborating with cross-functional teams, including product engineers and medical experts, to drive improvements in AI systems. Your analytical skills enable you to run systematic experiments that assess reasoning, factuality, and user experience, ensuring that the AI solutions you develop meet the highest standards of quality.
You are passionate about healthcare and the transformative potential of AI in this field. You stay updated with the latest methodologies in LLM evaluation and are eager to contribute to internal knowledge sharing within your team. You thrive in a collaborative environment and enjoy mentoring junior engineers, sharing your insights and best practices.
Experience with cloud computing platforms and distributed architectures is a plus. Familiarity with healthcare data and regulations will help you navigate the complexities of this domain effectively.
In your role at Doctolib, you will define and own the evaluation strategy for our AI systems, focusing on metrics, protocols, datasets, and tooling. You will implement and maintain automated evaluation pipelines that monitor model quality and safety across iterations. Your work will involve running systematic experiments to assess various aspects of the AI systems, including reasoning and user experience.
You will collaborate closely with model developers and research scientists, providing insights that drive iterative improvements in our AI solutions. Your contributions will be crucial in ensuring that our AI Health Companion behaves reliably and helpfully for millions of patients and practitioners.
You will also contribute to research on LLM evaluation methodologies, sharing your findings with the team to enhance our internal practices. Your role will require you to stay engaged with the latest advancements in AI and machine learning, ensuring that Doctolib remains at the forefront of healthcare technology.
At Doctolib, we are committed to transforming healthcare through innovative technology. You will be part of a dynamic team that values collaboration and knowledge sharing. We offer a competitive salary and benefits package, along with opportunities for professional growth and development. Join us in our mission to improve healthcare delivery and make a meaningful impact on the lives of patients and practitioners.
Apply now or save it for later. Get alerts for similar jobs at Doctolib.