LeethubLeethub
JobsCompaniesBlog
Go to dashboard

Leethub

Curated tech jobs from FAANG and top companies worldwide.

Top Companies

  • Google Jobs
  • Meta Jobs
  • Amazon Jobs
  • Apple Jobs
  • Netflix Jobs
  • All Companies →

Job Categories

  • Software Engineering
  • Data, AI & Machine Learning
  • Product Management
  • Design & User Experience
  • Operations & Strategy
  • Remote Jobs
  • All Categories →

Browse by Type

  • Remote Jobs
  • Hybrid Jobs
  • Senior Positions
  • Entry Level
  • All Jobs →

Resources

  • Google Interview Guide
  • Salary Guide 2025
  • Salary Negotiation
  • LeetCode Study Plan
  • All Articles →

Company

  • Dashboard
  • Privacy Policy
  • Contact Us
© 2026 Leethub LLC. All rights reserved.
Home›Jobs›Amazon›Software Development Engineer, ML Infrastructure Team
Amazon

About Amazon

The everything store and cloud computing leader

🏢 Tech👥 1001+ employees📅 Founded 1995📍 South Lake Union, Seattle, WA⭐ 3.7
B2CB2BMarketplaceCloud ComputingeCommerce

Key Highlights

  • Headquartered in South Lake Union, Seattle, WA
  • Over 1.5 million employees worldwide
  • Leading cloud services through Amazon Web Services (AWS)
  • Acquired Whole Foods, Twitch, and Ring

Amazon, headquartered in South Lake Union, Seattle, WA, is the world's largest online retailer and a leader in cloud computing through Amazon Web Services (AWS). With over 1.5 million employees globally, Amazon operates in various sectors, including AI with its Alexa devices and a vast marketplace k...

🎁 Benefits

Amazon offers competitive salaries, stock options, generous PTO policies, and comprehensive health benefits. Employees also have access to a learning ...

🌟 Culture

Amazon's culture is driven by customer obsession and a focus on innovation. The company encourages employees to think big and move fast, fostering an ...

🌐 Website💼 LinkedIn𝕏 TwitterAll 94518 jobs →
Amazon

Software Development Engineer, ML Infrastructure Team

Amazon • Seattle, Washington, USA

Posted 3 months ago🏛️ On-SiteMid-LevelSoftware development engineer📍 Seattle
Apply Now →

Job Description

Want to help drive the success of Machine Learning technologies at AWS? Do you have the skills and motivation to build automation that supports the success of peer teams? We want to talk to you!

We seek a Software Development Engineer for the Machine Learning (ML) Infrastructure team to build the tools that are used to guarantee top performance of AWS ML and High Performance Computing (HPC) technologies developed by our organization. Bring your exceptional knowledge of CI/CD automation, ML and HPC benchmarks and applications to bear on the cutting-edge software we develop. Join us as we expand the AWS offerings for AI, including Trainium, Neuron and the Elastic Fabric Adapter (EFA).

Key job responsibilities
Be an autonomous engineer on a team that builds and maintains the infrastructure that monitors and reports on functionality and performance of massive testing workloads run at scale. Use internal Amazon CI/CD tools, Linux, and public AWS products to automate the delivery of our software to customers, saving developer time. Write Python code that effortlessly spools up large clusters and runs benchmarks and applications for ML and HPC workloads. Use AWS Managed Grafana and Athena to digest the massive amount of performance data generated by these workloads and create dashboards for developers and stakeholders. Invent automatic mechanisms to alert developers to functional and performance regressions so they never reach reach customers. Manage the complexity of infrastructure that covers many instance types, software stacks, Linux operating systems, cutting-edge releases and make it easy to evolve.

A day in the life
You use Typescript and the CDK to ensure all infrastructure setup is code (IoC), reviewed and committed to automated pipelines. You find innovative ways to schedule work using SLURM and Active Directory, supporting multiple teams of developers while keeping cluster costs down. You write crisp designs for your projects, communicating clearly to your peers what you will build.

About the team
We are part of Annapurna Labs, a subsidiary in AWS that builds software and hardware that make ML on EC2 work. Our organization is a dedicated group of innovators that have invented new networks, new silicon, new software suites, and combined those to entice customers to move immense ML and HPC workloads to the cloud. The ML Infrastructure team is laser focused on making AWS the best and most cost-effective place for customers to do AI at scale. - 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience with CI/CD pipelines build processes
- Experience using Linux, demonstrating proficiency with associated tools or languages
- Experience coding in Python, Typescript, CDK- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Amazon.

Apply Now →Get Job Alerts