System Engineer (Token Factory)

Nebius AI • Amsterdam, Netherlands; Germany; Israel; Prague, Czech Republic; Remote - Europe; Remote - United States; United Kingdom

Posted 3h ago🏠 Remote Systems engineer 📍 Amsterdam 📍 Germany 📍 Israel 📍 Prague 📍 United kingdom 📍 United states

Apply Now →

Skills & Technologies

C++Gpu programming

Overview

Nebius AI is seeking a Systems Engineer for their Token Factory team to develop and optimize AI inference platforms. You'll work with C++ and GPU programming to enhance performance across various hardware architectures. This role requires strong technical expertise in low-level systems.

Job Description

Who you are

You have a strong proficiency in C++ and expertise in GPU programming, particularly focusing on low-level systems. Your experience includes developing and optimizing low-level kernels and runtime components for AI inference, which is crucial for enhancing performance in inference engines. You are skilled at profiling and debugging system-level and hardware-level performance issues, ensuring that the systems you work on run efficiently and effectively. You have a collaborative mindset, working closely with machine learning and backend teams to optimize end-to-end execution of AI applications. Your understanding of new hardware architectures, such as Hopper, Blackwell, and Rubin, allows you to integrate support for cutting-edge technologies seamlessly.

Desirable

Experience with AI and machine learning frameworks is a plus, as it complements your technical skills. Familiarity with cloud computing environments and large-scale GPU platforms will enhance your ability to contribute to the team. You are eager to learn and adapt to new technologies, which is essential in the fast-evolving field of AI.

What you'll do

As a Systems Engineer at Nebius AI, you will be responsible for developing and optimizing the inference platform that supports various foundation models, including text, vision, audio, and emerging multimodal architectures. Your role will involve improving the performance of inference engines on GPU platforms, ensuring that they are fast, reliable, and effortless to deploy at scale. You will profile and debug system-level and hardware-level performance issues, identifying bottlenecks and implementing solutions to enhance overall system efficiency. Collaborating with machine learning and backend teams, you will optimize the end-to-end execution of AI applications, ensuring that they meet the high standards expected in the industry.

What we offer

Nebius AI provides a competitive salary and a comprehensive benefits package, along with opportunities for professional growth within the company. You will enjoy flexible working arrangements, allowing you to balance your personal and professional life effectively. The work environment is dynamic and collaborative, valuing initiative and innovation. As part of a rapidly growing team, you will have the chance to contribute to exciting projects that are shaping the future of AI and cloud computing. If you are excited about the challenges and opportunities in this field, we encourage you to apply and join us in leading the next era of AI cloud infrastructure.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Nebius AI.

Apply Now →Get Job Alerts

About Nebius AI

Key Highlights

🎁 Benefits

🌟 Culture