LeethubLeethub
JobsCompaniesBlog
Go to dashboard

Leethub

Curated tech jobs from FAANG and top companies worldwide.

Top Companies

  • Google Jobs
  • Meta Jobs
  • Amazon Jobs
  • Apple Jobs
  • Netflix Jobs
  • All Companies →

Job Categories

  • Software Engineering
  • Data, AI & Machine Learning
  • Product Management
  • Design & User Experience
  • Operations & Strategy
  • Remote Jobs
  • All Categories →

Browse by Type

  • Remote Jobs
  • Hybrid Jobs
  • Senior Positions
  • Entry Level
  • All Jobs →

Resources

  • Google Interview Guide
  • Salary Guide 2025
  • Salary Negotiation
  • LeetCode Study Plan
  • All Articles →

Company

  • Dashboard
  • Privacy Policy
  • Contact Us
© 2026 Leethub LLC. All rights reserved.
Home›Jobs›Doctolib›Senior Data Engineer - AI Focused (x/f/m)
Doctolib

About Doctolib

Simplifying healthcare access for millions

👥 1K-5K📅 Founded 2013📍 Levallois-Perret, Île-de-France, France

Key Highlights

  • 17,000+ healthcare professionals using the platform
  • 6 million patients served monthly
  • Presence in 435 healthcare facilities across Europe
  • Headquartered in Levallois-Perret, France

Doctolib is the leading European platform for online medical appointment scheduling, serving over 17,000 healthcare professionals and connecting with 6 million patients monthly. Headquartered in Levallois-Perret, Île-de-France, Doctolib is present in 435 healthcare facilities across France and Germa...

🎁 Benefits

Employees enjoy competitive salaries, stock options, generous PTO, and a flexible remote work policy, promoting a healthy work-life balance....

🌟 Culture

Doctolib fosters a culture centered around improving healthcare access, emphasizing technology-driven solutions and a commitment to user experience. T...

🌐 Website💼 LinkedIn𝕏 TwitterAll 213 jobs →
Doctolib

Senior Data Engineer - AI Focused (x/f/m)

Doctolib • Paris, Paris, France

Posted 3d agoSeniorData engineer📍 Paris
Apply Now →

Skills & Technologies

GCPBigQueryDataflowPub/subCloud storageVertex aiNosqlVector databases

Overview

Doctolib is seeking a Senior Data Engineer focused on AI to build and optimize data foundations for AI models. You'll work with GCP and various data technologies to ensure high-quality data for healthcare applications.

Job Description

Who you are

You have 5+ years of experience as a Data Engineer, with a strong focus on building scalable data pipelines and ensuring data quality for AI applications. Your expertise in Google Cloud Platform (GCP) allows you to design and maintain data infrastructures that support machine learning and AI initiatives. You are familiar with both structured and unstructured data, and you understand how to integrate various data sources into unified models that can be utilized for AI consumption.

Your background includes working with NoSQL and Vector Databases, enabling you to efficiently store and retrieve embeddings and documents. You have a solid understanding of data governance and privacy, ensuring that the data you work with is compliant and reliable. You thrive in collaborative environments, working closely with machine learning and platform teams to define data schemas and partitioning strategies that enhance performance and scalability.

Desirable

Experience with large language models (LLMs) and multimodal models is a plus, as is familiarity with data quality and lineage frameworks. You are comfortable optimizing data pipelines for performance and cost, leveraging GCP native services to achieve the best results.

What you'll do

In your role at Doctolib, you will be responsible for building and optimizing the data foundations within the AI Team. This includes designing, building, and maintaining scalable data pipelines on GCP tailored for AI and machine learning use cases. You will implement data ingestion and transformation frameworks that power retrieval systems and training datasets for LLMs and multimodal models.

You will ensure high standards of data quality for AI model inputs, collaborating with engineers and data scientists to facilitate efficient training, evaluation, and deployment of AI models. Your work will involve architecting and managing NoSQL and Vector Databases to store and retrieve data effectively, ensuring that the data is well-structured and compliant.

You will also integrate various data sources, including text, speech, images, and documents, into unified data models that are ready for AI consumption. Your role will require you to optimize the performance and cost of data pipelines using GCP services such as BigQuery, Dataflow, Pub/Sub, Cloud Storage, and Vertex AI. You will contribute to data quality and lineage frameworks, ensuring that AI models are trained on validated and reliable data.

What we offer

At Doctolib, you will join a dedicated team on a mission to transform healthcare through AI. We offer a collaborative work environment where your contributions will have a direct impact on the healthcare industry. You will have the opportunity to work with cutting-edge technologies and be part of a team that values innovation and excellence. We encourage you to apply even if your experience doesn't match every requirement, as we believe in the potential of diverse backgrounds and perspectives.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Doctolib.

Apply Now →Get Job Alerts