LeethubLeethub
JobsCompaniesBlog
Go to dashboard

Leethub

Curated tech jobs from FAANG and top companies worldwide.

Top Companies

  • Google Jobs
  • Meta Jobs
  • Amazon Jobs
  • Apple Jobs
  • Netflix Jobs
  • All Companies →

Job Categories

  • Software Engineering
  • Data, AI & Machine Learning
  • Product Management
  • Design & User Experience
  • Operations & Strategy
  • Remote Jobs
  • All Categories →

Browse by Type

  • Remote Jobs
  • Hybrid Jobs
  • Senior Positions
  • Entry Level
  • All Jobs →

Resources

  • Google Interview Guide
  • Salary Guide 2025
  • Salary Negotiation
  • LeetCode Study Plan
  • All Articles →

Company

  • Dashboard
  • Privacy Policy
  • Contact Us
© 2026 Leethub LLC. All rights reserved.
Home›Jobs›Nebius AI›Senior Software Engineer in Hardware Infrastructure Observability
Nebius AI

About Nebius AI

Empowering AI with robust infrastructure solutions

🏢 Tech👥 51-250📅 Founded 2022📍 Amsterdam, North Holland, Netherlands

Key Highlights

  • Publicly traded on Nasdaq, expanding AI infrastructure market
  • Headquartered in Amsterdam with hubs in the US, Europe, and Israel
  • Team of around 400 skilled engineers focused on AI/ML
  • Specializes in large-scale GPU clusters and cloud platforms

Nebius is a Nasdaq-listed company headquartered in Amsterdam, specializing in AI infrastructure solutions. With a team of around 400 engineers, Nebius provides large-scale GPU clusters and cloud platforms designed to support the rapid growth of the AI industry. The company has established R&D and co...

🎁 Benefits

Nebius offers competitive equity packages, a flexible PTO policy, and opportunities for remote work. Employees also benefit from a learning budget to ...

🌟 Culture

Nebius fosters a culture centered around engineering excellence and innovation in AI infrastructure. The company values collaboration across its globa...

🌐 Website💼 LinkedInAll 186 jobs →
Nebius AI

Senior Software Engineer in Hardware Infrastructure Observability

Nebius AI • Amsterdam, Netherlands

Posted 2w ago🏛️ On-SiteSeniorSoftware engineering📍 Amsterdam
Apply Now →

Skills & Technologies

PythonLinuxDockerKubernetesPrometheusGrafana

Overview

Nebius AI is seeking a Senior Software Engineer to join their Hardware Infrastructure Observability team. You'll design and develop services for monitoring server fleets and data center systems, utilizing skills in Python and Linux. This role is based in Amsterdam.

Job Description

Who you are

You have 5+ years of experience in software engineering, particularly in building and maintaining infrastructure observability systems. Your expertise in Python and Linux allows you to develop robust monitoring solutions that ensure the reliability of large-scale server fleets. You are familiar with containerization technologies such as Docker and orchestration tools like Kubernetes, which you have used to streamline deployment processes and enhance system performance. Your experience with monitoring tools like Prometheus and Grafana enables you to create insightful dashboards and alerts that help maintain system health. You thrive in collaborative environments, working closely with cross-functional teams to drive improvements and resolve incidents effectively. You are proactive in investigating issues and implementing root-cause fixes, ensuring that systems remain operational and efficient.

Desirable

Experience with cloud infrastructure and AI/ML systems is a plus, as is familiarity with incident response protocols and debugging techniques. You are comfortable working in a fast-paced environment and are eager to learn new technologies that can enhance your contributions to the team.

What you'll do

As a Senior Software Engineer at Nebius, you will be responsible for designing and developing services and agents that provide deep visibility into a large server fleet and data center engineering systems. You will evolve metrics, aggregation, and alerting pipelines to improve signal quality and ensure that the infrastructure remains healthy. Your role will involve building maintenance workflows and automation processes that facilitate safe and predictable fleet-wide changes. You will also investigate incidents hands-on, including on-host debugging, and drive root-cause fixes to enhance system reliability. Collaboration with other engineers and teams will be key as you work to improve the overall performance and efficiency of the infrastructure.

What we offer

Nebius offers a competitive salary and a comprehensive benefits package, along with opportunities for professional growth within the company. You will enjoy flexible working arrangements and be part of a dynamic and collaborative work environment that values initiative and innovation. As Nebius continues to grow and expand its products, you will have the chance to contribute to exciting projects that shape the future of AI cloud infrastructure.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Nebius AI.

Apply Now →Get Job Alerts