LeethubLeethub
JobsCompaniesBlog
Go to dashboard

Leethub

Curated tech jobs from FAANG and top companies worldwide.

Top Companies

  • Google Jobs
  • Meta Jobs
  • Amazon Jobs
  • Apple Jobs
  • Netflix Jobs
  • All Companies →

Job Categories

  • Software Engineering
  • Data, AI & Machine Learning
  • Product Management
  • Design & User Experience
  • Operations & Strategy
  • Remote Jobs
  • All Categories →

Browse by Type

  • Remote Jobs
  • Hybrid Jobs
  • Senior Positions
  • Entry Level
  • All Jobs →

Resources

  • Google Interview Guide
  • Salary Guide 2025
  • Salary Negotiation
  • LeetCode Study Plan
  • All Articles →

Company

  • Dashboard
  • Privacy Policy
  • Contact Us
© 2026 Leethub LLC. All rights reserved.
Home›Jobs›Apple›AIML - Machine Learning Engineer, Foundation Model Services
Apple

About Apple

The personal technology company redefining user experience

🏢 Tech, Hardware👥 1001+ employees📅 Founded 1976📍 Cupertino, CA⭐ 4.2
B2CB2BHardwareSaaSTelecommunicationseCommerce

Key Highlights

  • Market cap of $3 trillion as of 2022
  • Over 1 billion active devices worldwide
  • Comprehensive medical plans including mental healthcare
  • Paid parental leave and gradual return-to-work program

Apple Inc. (NASDAQ: AAPL), headquartered in Cupertino, CA, is the world's most valuable company with a market capitalization of $3 trillion as of 2022. Known for its iconic products such as the iPhone, iPad, and Mac, Apple serves over 1 billion active devices globally. The company has a strong commi...

🎁 Benefits

Apple offers comprehensive medical plans covering physical and mental healthcare, paid parental leave, and a gradual return-to-work program. Employees...

🌟 Culture

Apple's culture emphasizes an obsessive focus on user experience and consumer privacy, setting it apart from competitors. The company promotes inclusi...

🌐 Website💼 LinkedIn𝕏 TwitterAll 4777 jobs →
Apple

AIML - Machine Learning Engineer, Foundation Model Services

Apple • Seattle, Washington, United States

Posted 6 months ago🏛️ On-SiteMid-LevelMachine learning engineer📍 Seattle
Apply Now →

Job Description

Do you feel you think differently, you are eager to break status quo, are bold and ambitious, aren’t afraid to take risks and are passionate to build the best of class technology. If yes, what better place to be at and do this than Apple? At Apple, “we think different, we push the boundaries of computing and intelligence. We build products that bring smile to people’s face”. Foundation Model Infrastructure team, within Machine Learning Platform Technologies organization is the back-bone of Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make a difference in life of people. You will have a chance to work on optimizing billions of parameter language and vision and speech models using state of the art technologies and make it run at scale of Apple.

Description

Work along side Foundation Model Research team to optimize inference for cutting edge model architectures. Work closely with product teams to build Production grade solutions to launch models serving millions of customers in real time. Build tools to understand bottlenecks in Inference for different hardwares and use cases. Mentor and guide engineers in the organization.

Minimum Qualifications

Demonstrated experience in leading and driving complex, ambiguous projects. Experience with high throughput services particularly at supercomputing scale. Proficient in running applications on Cloud (AWS, Azure, or equivalent) using Kubernetes and Docker. Familiar with GPU programming concepts using CUDA and with popular machine learning frameworks like PyTorch or TensorFlow.

Preferred Qualifications

Proficient in building and maintaining systems written in modern languages (e.g. Go, Python). Familiar with fundamental deep learning architectures such as Transformer models and encoder/decoder models. Familiar with NVIDIA TensorRT-LLM, vLLM, DeepSpeed, NVIDIA Triton Inference Server. Experience in writing custom CUDA kernels using CUDA or OpenAI Triton.

Eeo Content

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Apple.

Apply Now →Get Job Alerts