
The cloud monitoring platform engineers love
Datadog (NYSE: DDOG) is a leading cloud observability platform that provides monitoring and analytics for applications, infrastructure, and logs. Trusted by over 26,000 customers including major companies like Netflix, Samsung, and Airbnb, Datadog is headquartered in New York City. The company went ...
Datadog offers competitive salaries, equity options, generous PTO policies, and a flexible remote work policy. Employees also benefit from a learning ...
Datadog fosters an engineering-first culture, with 70% of its workforce comprising engineers. The company emphasizes a strong focus on solving complex...

Datadog • New York, New York, USA
Datadog is hiring a Senior AI Engineer to lead the development of AI-powered features for Application Performance Monitoring. You'll work with technologies like Python and Machine Learning to enhance performance detection and resolution. This role requires strong expertise in AI and experience with Datadog's platform.
You have 5+ years of experience in AI engineering, focusing on building and deploying machine learning models that enhance product capabilities. Your background includes a deep understanding of application performance monitoring and the ability to analyze complex telemetry data to derive actionable insights. You are proficient in Python and have hands-on experience with AI frameworks and tools, including Datadog. You thrive in collaborative environments, working closely with product teams to shape user experiences and drive product innovation. You are comfortable with both the technical and product aspects of engineering, ensuring that solutions are not only effective but also user-friendly. You are passionate about leveraging AI to solve real-world problems and improve system reliability.
Experience with large language models (LLMs) and autonomous agents is a plus, as is familiarity with distributed tracing and service representation. You have a knack for prototyping and iterating on solutions quickly, and you understand the importance of defining success metrics and conducting experiments to validate your ideas.
In this role, you will lead the end-to-end development of AI features for Datadog's APM Experiences team. You will design and implement LLM and agent-based workflows that analyze application performance data, providing insights and recommendations to users. Your responsibilities will include building robust agent systems that can autonomously diagnose issues and suggest optimizations. You will collaborate with cross-functional teams to integrate these features into Datadog's platform, ensuring they deliver real value to users. You will prototype quickly, define success metrics, and iterate based on user feedback to refine the product. Your work will directly impact how customers interact with application performance monitoring, helping them resolve issues faster and more effectively.
Datadog offers a dynamic work environment where innovation is encouraged. You will have the opportunity to work with cutting-edge technologies and contribute to a product that is essential for many businesses. We provide competitive compensation and benefits, along with opportunities for professional growth and development. Join us in shaping the future of application performance monitoring with AI-driven solutions.
Apply now or save it for later. Get alerts for similar jobs at Datadog.