
Building safe and reliable AI systems for everyone
Anthropic, headquartered in SoMa, San Francisco, is an AI safety and research company focused on developing reliable, interpretable, and steerable AI systems. With over 1,000 employees and backed by Google, Anthropic has raised $29.3 billion in funding, including a monumental Series F round of $13 b...
Anthropic offers comprehensive health, dental, and vision insurance for employees and their dependents, along with inclusive fertility benefits via Ca...
Anthropic's culture is rooted in AI safety and reliability, with a focus on producing less harmful outputs compared to existing AI systems. The compan...

Anthropic • Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
Anthropic is seeking an AI Research Engineer to develop next-generation training environments for agentic AI systems. You'll work on reinforcement learning and collaborate across teams to push the boundaries of AI capabilities. This role requires a blend of research and engineering skills.
You have a strong background in AI and machine learning, particularly in reinforcement learning — you've contributed to research that advances the state of the art and understand the complexities of training AI models in challenging environments. You possess a blend of research and engineering skills, allowing you to implement novel approaches while also guiding research direction. Your experience includes designing training environments that enable AI to navigate ambiguity and exercise judgment in open-ended scenarios. You thrive in collaborative settings, working closely with researchers and engineers to ship environments into production training. You are comfortable debugging and iterating across research and production ML stacks, ensuring that the systems you build are robust and effective. You are passionate about creating safe and beneficial AI systems that can positively impact society.
In this role, you will be responsible for building the next generation of agentic environments that challenge AI models to perform complex tasks. You will design rigorous evaluations that measure real capability, ensuring that the AI systems you develop are both effective and safe. Collaboration is key, as you will work across research and infrastructure teams to ship environments into production training. You will debug and iterate rapidly, adapting your approaches based on feedback and results from the training processes. Your contributions will help shape the research culture at Anthropic, fostering an environment where innovative ideas can flourish and lead to impactful advancements in AI.
Anthropic is committed to creating a supportive and flexible work environment. We offer competitive compensation and benefits, including optional equity donation matching and generous vacation and parental leave. Our team enjoys flexible working hours and a collaborative office space in San Francisco, where you can engage with colleagues and contribute to meaningful projects. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds in our mission to build beneficial AI systems.
Apply now or save it for later. Get alerts for similar jobs at Anthropic.