
Empowering AI with robust infrastructure solutions
Nebius is a Nasdaq-listed company headquartered in Amsterdam, specializing in AI infrastructure solutions. With a team of around 400 engineers, Nebius provides large-scale GPU clusters and cloud platforms designed to support the rapid growth of the AI industry. The company has established R&D and co...
Nebius offers competitive equity packages, a flexible PTO policy, and opportunities for remote work. Employees also benefit from a learning budget to ...
Nebius fosters a culture centered around engineering excellence and innovation in AI infrastructure. The company values collaboration across its globa...

Nebius AI • Amsterdam, Netherlands; Germany; Israel; Prague, Czech Republic; Remote - Europe; Remote - United States; United Kingdom
Nebius AI is seeking a Systems Engineer for their Token Factory team to develop and optimize AI inference platforms. You'll work with C++ and GPU programming to enhance performance across various hardware architectures. This role requires strong technical expertise in low-level systems.
You have a strong proficiency in C++ and expertise in GPU programming, particularly focusing on low-level systems. Your experience includes developing and optimizing low-level kernels and runtime components for AI inference, which is crucial for enhancing performance in inference engines. You are skilled at profiling and debugging system-level and hardware-level performance issues, ensuring that the systems you work on run efficiently and effectively. You have a collaborative mindset, working closely with machine learning and backend teams to optimize end-to-end execution of AI applications. Your understanding of new hardware architectures, such as Hopper, Blackwell, and Rubin, allows you to integrate support for cutting-edge technologies seamlessly.
Experience with AI and machine learning frameworks is a plus, as it complements your technical skills. Familiarity with cloud computing environments and large-scale GPU platforms will enhance your ability to contribute to the team. You are eager to learn and adapt to new technologies, which is essential in the fast-evolving field of AI.
As a Systems Engineer at Nebius AI, you will be responsible for developing and optimizing the inference platform that supports various foundation models, including text, vision, audio, and emerging multimodal architectures. Your role will involve improving the performance of inference engines on GPU platforms, ensuring that they are fast, reliable, and effortless to deploy at scale. You will profile and debug system-level and hardware-level performance issues, identifying bottlenecks and implementing solutions to enhance overall system efficiency. Collaborating with machine learning and backend teams, you will optimize the end-to-end execution of AI applications, ensuring that they meet the high standards expected in the industry.
Nebius AI provides a competitive salary and a comprehensive benefits package, along with opportunities for professional growth within the company. You will enjoy flexible working arrangements, allowing you to balance your personal and professional life effectively. The work environment is dynamic and collaborative, valuing initiative and innovation. As part of a rapidly growing team, you will have the chance to contribute to exciting projects that are shaping the future of AI and cloud computing. If you are excited about the challenges and opportunities in this field, we encourage you to apply and join us in leading the next era of AI cloud infrastructure.
Apply now or save it for later. Get alerts for similar jobs at Nebius AI.