
Empowering corporate mentorship for effective learning
Together is a corporate mentorship management platform founded in 2018, headquartered in CityPlace, Toronto, ON. The platform streamlines the mentorship lifecycle, facilitating connections among employees at companies like Heineken, Reddit, and 7-Eleven. With $1.7 million in seed funding, Together a...
Together offers competitive salaries and equity packages, 4 weeks of paid vacation, and a comprehensive health, dental, and vision plan through Honeyb...
Together fosters a culture of autonomy and impact, allowing employees to take on significant responsibilities without bureaucratic constraints. The fo...

Together AI • San Francisco
Together AI is hiring a Systems Research Engineer specialized in GPU Programming to develop and optimize GPU-accelerated kernels for ML/AI applications. You'll collaborate with cross-functional teams and leverage your expertise in GPU programming and parallel computing. This role requires a strong background in GPU programming techniques.
You have a strong background in GPU programming and parallel computing, with expertise in technologies such as CUDA and/or Triton. Your knowledge of ML/AI applications and models allows you to contribute effectively to the development of GPU-accelerated solutions. You possess excellent problem-solving and analytical skills, enabling you to optimize and fine-tune GPU code for better performance and scalability. With a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experience, you are well-equipped to tackle complex challenges in this field.
Staying up-to-date with the latest advancements in GPU programming techniques is important to you, and you are eager to apply this knowledge to enhance the performance and efficiency of AI systems. Your collaborative spirit allows you to work effectively with cross-functional teams, integrating GPU-accelerated solutions into existing software systems.
As a Systems Research Engineer at Together AI, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. You will work closely with the modeling and algorithm team to co-design GPU kernels and model architecture, ensuring that our AI infrastructure remains at the forefront of innovation. Your research skills will be vital in exploring new GPU programming techniques and contributing to the co-design of efficient GPU architectures and programming models.
You will optimize and fine-tune GPU code to achieve better performance and scalability, collaborating with hardware and software teams to integrate GPU-accelerated solutions into existing systems. Your contributions will help enhance the overall efficiency of our AI systems, making a significant impact on our research-driven initiatives.
Together AI is committed to fostering an inclusive and innovative work environment. You will have the opportunity to work on cutting-edge technologies and contribute to the advancement of AI systems. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds. Join us in our mission to drive open and transparent AI systems that will shape the future of technology.
Apply now or save it for later. Get alerts for similar jobs at Together AI.