
Empowering the world through technology and information
Google LLC, headquartered in Mountain View, California, is a global leader in internet-related services and products, including its flagship search engine, Google Search, and the Android operating system. With over 100,000 employees, Google also offers cloud computing services through Google Cloud P...
Google offers competitive salaries, equity options, generous PTO policies, comprehensive health benefits, and a remote work policy that allows flexibi...
Google is known for its engineering-first culture, emphasizing innovation and collaboration. The company fosters a unique environment that encourages ...

Google • Mountain View, CA, USA, Los Angeles, CA, USA, Seattle, WA, USA, San Francisco, CA, USA
Google is seeking a Senior Research Software Engineer specializing in ML Network Optimization. You'll work with C++, Python, and various machine learning optimization techniques to enhance performance in complex systems. This role requires 5+ years of experience in software development and machine learning.
You have a Bachelor’s degree in Computer Science or a related technical field, along with 5 years of experience in software development using languages such as C++ and Python. Your background includes 3 years of experience in testing, maintaining, or launching software products, and you possess a solid understanding of software design and architecture. You have specialized knowledge in Machine Learning Optimization, including techniques like quantization and distillation, and are familiar with GPU/TPU hardware architecture and kernel programming. Your experience extends to ML infrastructure, where you have worked on model deployment, evaluation, optimization, and debugging. A Master's degree or PhD in a relevant field is preferred, along with experience in data structures, algorithms, and hardware-software co-design. You have also held a technical leadership role, driving architectural transitions and optimizing real-time media pipelines.
Experience with kernel programming technologies such as CUDA, OpenCL, and Vulkan is a plus, as is familiarity with compiler technologies like XLA. You have a passion for developing accessible technologies and are eager to collaborate with hardware and compiler teams to influence next-generation architecture and software stack requirements.
In this role, you will analyze and profile complex ML models and graphics pipelines to identify and resolve performance bottlenecks. You will collaborate closely with hardware and compiler teams to facilitate long-term gains in the performance of our ML software and hardware stack. Your work will involve navigating hardware-software co-design, ensuring that our products can handle information at massive scale. You will contribute to the development of next-generation technologies that change how billions of users connect and interact with information. Your responsibilities will include testing and maintaining software products, as well as launching new features that enhance user experience. You will also be involved in mentoring junior engineers and sharing your expertise in machine learning optimization techniques.
At Google, we provide an environment that fosters innovation and collaboration. You will have the opportunity to work on cutting-edge technologies that impact millions of users worldwide. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds. Our team is committed to creating accessible technologies and improving the performance of our systems. We offer competitive compensation and benefits, along with opportunities for professional growth and development within the company.
Apply now or save it for later. Get alerts for similar jobs at Google.