Want to professionalize your AI skills, pivot to an AI role and increase your salary?
Master AI Engineering with the most practical and comprehensive LLM Development certifications at Towards AI Academy.

Anthropic

Staff Software Engineer, AI Reliability Engineering

Anthropic

Published 11 May 2026
London, UK
325K - 390K GBP Annual
Full Time

Share this job

Role Highlights

Languages used

Key skills

Computer Science
Distributed Systems
Site Reliability
LLMs
AI
API
Infrastructure
Cloud
SRE
Machine Learning
Testing
Research
Physics
Biology
Interpretability
Multimodal

Tools, Libraries and Frameworks

SDK
GPUs
HTTP

Description

The role involves improving the reliability of critical serving paths for AI systems. Responsibilities include developing service level objectives for large language model serving systems while balancing availability and latency. The position requires designing and implementing monitoring and observability systems across the token path. The individual will assist in creating high-availability serving infrastructure across multiple regions and cloud providers. Additionally, the role entails leading incident response efforts to ensure rapid recovery and systematic improvements.

Required Qualifications and Skills

Candidates must possess a background in distributed systems, infrastructure, or reliability. The role requires strong communication and collaboration skills to build relationships across teams. A bachelor's degree or an equivalent combination of education, training, and experience in a relevant field is required. Applicants should demonstrate a holistic approach to system composition and a willingness to engage with unfamiliar systems during incidents.

Disclaimer

Disclaimer: Job and company description information and some of the data fields may have been generated via GPT-4 summarisation and could contain inaccuracies. The full external job listing link should always be relied on for authoritative information.

About the company

Anthropic

Size

265

Public/Private

Privately Held

Description

Anthropic is an AI safety and research company focused on crafting AI systems that are reliable, interpretable, and steerable, ensuring they remain safe and beneficial for users and society. With an interdisciplinary team experienced in machine learning, physics, policy, business, and product development, Anthropic is dedicated to the mission of beneficial AI. The company emphasizes collaborative big science research, leveraging a unified team approach to focus on large-scale research efforts aimed at advancing long-term goals of creating trustworthy AI systems.

Share

Share this job

Related jobs

Computer Science
Program Management
AI
Data
San Francisco, CA, USA
Full Time
Computer Science
SME
AI
Senior
Sydney, Australia
Full Time
Computer Science
Trust & Safety
LLMs
AI
New York City, NY
Full Time