Want to professionalize your AI skills, pivot to an AI role and increase your salary?
Master AI Engineering with the most practical and comprehensive LLM Development certifications at Towards AI Academy.

Anthropic

Software Engineer, Safeguards Infrastructure

Anthropic

Published 11 May 2026
London, UK
255K - 325K GBP Annual
Full Time

Share this job

Role Highlights

Languages used

TypeScript

Key skills

Computer Science
Trust & Safety
AI
Infrastructure
Data
Storage
Machine Learning
Research
Physics
Biology
Interpretability
Multimodal

Tools, Libraries and Frameworks

HTTP

Description

The role involves developing foundational infrastructure for safety, oversight, and intervention mechanisms within AI systems. Engineers will build systems designed to detect unwanted model behaviors and prevent disallowed use. The work includes creating data storage, management, metric, and evaluation systems, as well as tooling for human and agentic review. Responsibilities also include maintaining operational systems to ensure safety and user well-being while reducing the need for manual intervention. The position requires building multi-layered defenses that function effectively at scale.

Required Qualifications and Skills

Candidates are required to hold a Bachelor’s degree in Computer Science, Software Engineering, or possess comparable experience. The role necessitates four to ten years of professional software engineering experience. Proficiency in Python and the ability to work across the stack are essential requirements. Additionally, candidates must demonstrate strong communication skills to effectively explain complex technical concepts to non-technical stakeholders.

Disclaimer

Disclaimer: Job and company description information and some of the data fields may have been generated via GPT-4 summarisation and could contain inaccuracies. The full external job listing link should always be relied on for authoritative information.

About the company

Anthropic

Size

265

Public/Private

Privately Held

Description

Anthropic is an AI safety and research company focused on crafting AI systems that are reliable, interpretable, and steerable, ensuring they remain safe and beneficial for users and society. With an interdisciplinary team experienced in machine learning, physics, policy, business, and product development, Anthropic is dedicated to the mission of beneficial AI. The company emphasizes collaborative big science research, leveraging a unified team approach to focus on large-scale research efforts aimed at advancing long-term goals of creating trustworthy AI systems.

Share

Share this job

Related jobs

Computer Science
Program Management
AI
Data
San Francisco, CA, USA
Full Time
Computer Science
SME
AI
Senior
Sydney, Australia
Full Time
Computer Science
Trust & Safety
LLMs
AI
New York City, NY
Full Time