LeethubLeethub
JobsCompaniesBlog
Go to dashboard

Leethub

Curated tech jobs from FAANG and top companies worldwide.

Top Companies

  • Google Jobs
  • Meta Jobs
  • Amazon Jobs
  • Apple Jobs
  • Netflix Jobs
  • All Companies →

Job Categories

  • Software Engineering
  • Data, AI & Machine Learning
  • Product Management
  • Design & User Experience
  • Operations & Strategy
  • Remote Jobs
  • All Categories →

Browse by Type

  • Remote Jobs
  • Hybrid Jobs
  • Senior Positions
  • Entry Level
  • All Jobs →

Resources

  • Google Interview Guide
  • Salary Guide 2025
  • Salary Negotiation
  • LeetCode Study Plan
  • All Articles →

Company

  • Dashboard
  • Privacy Policy
  • Contact Us
© 2026 Leethub LLC. All rights reserved.
Home›Jobs›Reflection›Member of Technical Staff - Alignment Lead
Reflection

About Reflection

Unlocking knowledge with AI for smarter organizations

🏢 Tech👥 11-50📍 Brooklyn, New York, United States

Key Highlights

  • Headquartered in Brooklyn, New York
  • AI-powered platform using natural language processing
  • Focused on eliminating information silos
  • Team size of 11-50 employees

ReflectionAI, headquartered in Brooklyn, New York, provides an AI-driven knowledge management platform that leverages natural language processing to transform unstructured information from meetings, documents, and conversations into a searchable knowledge base. With a focus on enhancing productivity...

🎁 Benefits

Employees at ReflectionAI enjoy competitive salaries, equity options, flexible remote work policies, and generous PTO to maintain a healthy work-life ...

🌟 Culture

ReflectionAI fosters a culture of innovation and collaboration, encouraging employees to contribute ideas and solutions while prioritizing work-life b...

🌐 Website💼 LinkedIn𝕏 TwitterAll 27 jobs →
Reflection

Member of Technical Staff - Alignment Lead

Reflection • SF

Posted 8h ago🏛️ On-SiteLeadAi research engineer📍 San francisco
Apply Now →

Skills & Technologies

Machine learningPythonTensorFlowPyTorchReinforcement learning

Overview

Reflection is seeking a Lead AI Research Engineer to drive the alignment stack for their AI models. You'll work with methodologies like RLHF and RLAIF, focusing on improving model performance. This role requires a graduate degree in Computer Science or related fields and deep technical expertise in alignment methodologies.

Job Description

Who you are

You hold a graduate degree (MS or PhD) in Computer Science, Machine Learning, or a related discipline, and possess a deep technical command of alignment methodologies such as PPO, DPO, and rejection sampling. Your experience includes scaling these methodologies to large models, showcasing your strong engineering skills and comfort with complex ML codebases and distributed systems.

You have a proven track record of improving model behavior through data, reward modeling, or reinforcement learning techniques. Your background includes owning ambitious research or engineering agendas that led to measurable improvements in model performance. You thrive in collaborative environments, working closely with cross-functional teams to achieve shared goals.

Desirable

Experience with synthetic data pipelines and optimizing large-scale RL pipelines for stability and efficiency would be a plus. Familiarity with curating high-quality training data and designing feedback loops that translate alignment research into generalizable model gains is also desirable.

What you'll do

In this role, you will drive the entire alignment stack, focusing on instruction tuning, RLHF, and RLAIF to enhance model accuracy and instruction following. You will lead research efforts to design next-generation reward models and optimization objectives that significantly improve human preference performance. Your responsibilities will include curating high-quality training data and designing synthetic data pipelines to address complex reasoning and behavioral gaps.

You will optimize large-scale reinforcement learning pipelines for stability and efficiency, ensuring rapid iteration cycles for model improvements. Collaboration will be key as you work closely with pre-training and evaluation teams to create tight feedback loops that translate alignment research into generalizable model gains. Your leadership will guide the team in pushing the boundaries of AI alignment methodologies.

What we offer

Reflection offers a supportive work environment with a mission to build open superintelligence accessible to all. We provide fully paid parental leave for all new parents, including adoptive and surrogate journeys, along with financial support for family planning. Our benefits include paid time off when needed, relocation support, and daily lunch and dinner provided for all employees. We also host regular off-sites and team celebrations to foster connections among teammates.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Reflection.

Apply Now →Get Job Alerts