Staff + Sr. Software Engineer, AI Reliability

Anthropic

Published 11 May 2026

Share this job

San Francisco, CA, USA

325K - 485K USD Annual

Full Time

Share this job

Role Highlights

Languages used

Key skills

Computer Science

Distributed Systems

Site Reliability

LLMs

API

Infrastructure

Cloud

SRE

Machine Learning

Research

Physics

Biology

Interpretability

Multimodal

Tools, Libraries and Frameworks

SDK

GPUs

HTTP

Description

The role involves improving reliability across critical serving paths for AI systems. The individual will develop service level objectives and design monitoring and observability systems for token paths. Responsibilities include implementing high-availability infrastructure across multiple regions and cloud providers. The position requires leading incident responses to ensure rapid recovery and systematic improvements. Additionally, the role supports the reliability of safeguard model serving to meet safety commitments.

Required Qualifications and Skills

Candidates must possess a background in distributed systems, infrastructure, or reliability. The role requires strong communication and collaboration skills to build relationships across teams. A minimum of a bachelor’s degree or an equivalent combination of education, training, and experience is required. Candidates should be comfortable managing unfamiliar systems during incidents and thinking holistically about system composition.

Apply

Visit full job listing

Disclaimer

Disclaimer: Job and company description information and some of the data fields may have been generated via GPT-4 summarisation and could contain inaccuracies. The full external job listing link should always be relied on for authoritative information.

About the company

Anthropic

Size

265

Website

anthropic.com

Public/Private

Privately Held

Description

Anthropic is an AI safety and research company focused on crafting AI systems that are reliable, interpretable, and steerable, ensuring they remain safe and beneficial for users and society. With an interdisciplinary team experienced in machine learning, physics, policy, business, and product development, Anthropic is dedicated to the mission of beneficial AI. The company emphasizes collaborative big science research, leveraging a unified team approach to focus on large-scale research efforts aimed at advancing long-term goals of creating trustworthy AI systems.

Share this job

Related jobs

Software Engineer, Systems - Claude Code

Anthropic

Computer Science

Security

Reliability

San Francisco, CA, USA

Full Time

Product Support Manager

Anthropic

Computer Science

API

New York, NY, USA

Full Time

People Research Scientist

Anthropic

Data Engineer

Machine Learning

Data Science

Computer Science

San Francisco, CA, USA

Full Time