LeethubLeethub
JobsCompaniesBlog
Go to dashboard

Leethub

Curated tech jobs from FAANG and top companies worldwide.

Top Companies

  • Google Jobs
  • Meta Jobs
  • Amazon Jobs
  • Apple Jobs
  • Netflix Jobs
  • All Companies →

Job Categories

  • Software Engineering
  • Data, AI & Machine Learning
  • Product Management
  • Design & User Experience
  • Operations & Strategy
  • Remote Jobs
  • All Categories →

Browse by Type

  • Remote Jobs
  • Hybrid Jobs
  • Senior Positions
  • Entry Level
  • All Jobs →

Resources

  • Google Interview Guide
  • Salary Guide 2025
  • Salary Negotiation
  • LeetCode Study Plan
  • All Articles →

Company

  • Dashboard
  • Privacy Policy
  • Contact Us
© 2026 Leethub LLC. All rights reserved.
Home›Jobs›Meta (Facebook)›AI/HPC System Performance Engineer
Meta (Facebook)

About Meta (Facebook)

Connecting people through innovative technology

Key Highlights

  • Over 2.9 billion monthly active users across platforms
  • Headquartered in Menlo Park, California
  • Valued at over $800 billion
  • Significant investments in Oculus and AR/VR technology

Meta (formerly Facebook) is a leading technology company focused on building the metaverse, with over 2.9 billion monthly active users across its platforms, including Facebook, Instagram, and WhatsApp. Headquartered in Menlo Park, California, Meta has invested heavily in virtual reality and augmente...

🎁 Benefits

Meta offers competitive salaries, equity compensation, generous PTO policies, comprehensive health benefits, and a robust parental leave program. Empl...

🌟 Culture

Meta fosters a culture of innovation and experimentation, encouraging employees to take risks and explore new ideas. The company emphasizes a mission-...

🌐 WebsiteAll 1039 jobs →
Meta (Facebook)

AI/HPC System Performance Engineer

Meta (Facebook) • Austin, TX, Menlo Park, CA, New York, NY

Posted 2 months ago🏛️ On-SiteMid-LevelAi engineer📍 Austin📍 Menlo park📍 New york
Apply Now →

Job Description

Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI. This results in a dramatic scaling challenge that our engineers have to deal with on a daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like GPUs together. In addition, we need to ensure that the network is running smoothly and meets stringent performance and availability requirements of RDMA workloads. These workloads expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across stack: network fabric and host networking, communications lib and scheduling infrastructure.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Meta (Facebook).

Apply Now →Get Job Alerts