
Building safe and reliable AI systems for everyone
Anthropic, headquartered in SoMa, San Francisco, is an AI safety and research company focused on developing reliable, interpretable, and steerable AI systems. With over 1,000 employees and backed by Google, Anthropic has raised $29.3 billion in funding, including a monumental Series F round of $13 b...
Anthropic offers comprehensive health, dental, and vision insurance for employees and their dependents, along with inclusive fertility benefits via Ca...
Anthropic's culture is rooted in AI safety and reliability, with a focus on producing less harmful outputs compared to existing AI systems. The compan...

Anthropic • Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
Anthropic is seeking an AI Research Engineer for their Reward Models Platform to automate research workflows and build scalable tools for model training. This role requires collaboration with researchers and a focus on optimizing reward methodologies.
You have a strong background in AI research and engineering, with a keen understanding of the workflows involved in fine-tuning AI models. You are passionate about creating reliable and interpretable AI systems that can benefit users and society. You thrive in collaborative environments and enjoy partnering with researchers to identify and solve high-friction challenges in their workflows. You are motivated by the opportunity to make significant contributions to the development of AI technologies that are safe and beneficial.
In this role, you will work closely with the Finetuning teams to understand their research workflows and identify areas for automation. Your primary responsibility will be to build tools and infrastructure that streamline the experimentation process, reducing the time researchers spend on manual tasks. You will develop scalable platforms that allow for rapid experimentation with different reward methodologies, enabling the team to iterate quickly and effectively. Additionally, you will contribute directly to research projects, applying your expertise to enhance the development of reward models across various domains.
Anthropic offers a competitive compensation package along with benefits that support work-life balance, including generous vacation and parental leave. You will have the opportunity to work in a flexible environment, with options for remote work and travel as needed. Our office in San Francisco provides a collaborative space for you to engage with colleagues and contribute to meaningful projects that aim to advance the field of AI.
Apply now or save it for later. Get alerts for similar jobs at Anthropic.