
Building safe and reliable AI systems for everyone
Anthropic, headquartered in SoMa, San Francisco, is an AI safety and research company focused on developing reliable, interpretable, and steerable AI systems. With over 1,000 employees and backed by Google, Anthropic has raised $29.3 billion in funding, including a monumental Series F round of $13 b...
Anthropic offers comprehensive health, dental, and vision insurance for employees and their dependents, along with inclusive fertility benefits via Ca...
Anthropic's culture is rooted in AI safety and reliability, with a focus on producing less harmful outputs compared to existing AI systems. The compan...

Anthropic • Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC
Anthropic is seeking a Technical Policy Manager specializing in Cyber Harms to lead efforts in preventing AI misuse in the cybersecurity domain. You'll work closely with research engineers to design safety systems and inform actionable policies. This role requires deep technical expertise in cybersecurity.
You are a cybersecurity expert with a strong background in technical policy management — you have experience leading teams focused on cyber threat modeling and evaluation frameworks. Your expertise allows you to translate complex cyber threat concepts into actionable policies and technical safeguards. You understand the balance between advancing legitimate security research and preventing misuse by malicious actors. You are committed to creating reliable and interpretable AI systems that are safe and beneficial for users and society. You thrive in collaborative environments, working closely with researchers, engineers, and policy experts to shape responsible AI safety in the cybersecurity domain.
Experience in leading cross-functional teams and a deep understanding of AI safety principles would be advantageous. Familiarity with the latest cybersecurity threats and trends will help you excel in this role. You are comfortable navigating the intersection of technology and policy, ensuring that safety systems are effective against real-world threats.
In this role, you will lead a team of technical specialists dedicated to preventing AI misuse in the cyber domain. You will oversee the design and execution of safety systems that detect harmful cyber behaviors, ensuring that your team's cybersecurity domain knowledge is effectively applied. You will collaborate with research engineers to inform the design of these systems, providing critical insights that enhance their effectiveness against sophisticated threat actors. Your leadership will be pivotal in defining what responsible AI safety looks like in the cybersecurity landscape. You will engage with various stakeholders to translate complex cyber threat concepts into concrete technical safeguards and actionable policies. This role offers a unique opportunity to shape the future of AI in cybersecurity, balancing innovation with safety.
Anthropic provides a supportive work environment that encourages collaboration and innovation. We offer competitive compensation and benefits, including optional equity donation matching, generous vacation, and parental leave. Our flexible working hours allow you to maintain a healthy work-life balance. You will have the opportunity to work in a lovely office space in San Francisco, collaborating with a diverse team of committed professionals. We are dedicated to creating a workplace that values your contributions and supports your professional growth.
Apply now or save it for later. Get alerts for similar jobs at Anthropic.