Staff Software Engineer, Kubernetes Platform

Anthropic

Published 11 May 2026

Share this job

San Francisco, CA, USA

320K - 405K USD Annual

Full Time

Share this job

Role Highlights

Languages used

Python

Rust

C++

Key skills

Computer Science

API

Distributed Systems

IAC

Cloud

Research

Machine Learning

Cluster

Operators

Inference

Reliability

Batch

Infrastructure

Physics

Biology

Interpretability

Multimodal

Tools, Libraries and Frameworks

Linux Kernel

Kubernetes

DNS

GPUs

GCP

AWS

GKE

EKS

HTTP

Description

The role involves managing and extending large-scale Kubernetes clusters to support the training and serving of frontier AI models. Responsibilities include scaling the Kubernetes control plane and developing custom scheduling plugins to handle complex, topology-sensitive workloads. The position requires building and maintaining core cluster services to ensure high availability and performance under significant pressure. Additionally, the role entails collaborating with research and infrastructure teams to translate workload requirements into platform capabilities. The work focuses on maintaining system reliability and correctness as the organization's compute footprint expands.

Required Qualifications and Skills

Candidates must possess significant software engineering experience in building and operating production distributed systems. Proficiency in at least one systems-appropriate language such as Go, Python, Rust, or C++ is required, along with deep, hands-on experience with Kubernetes internals. Applicants should demonstrate an ability to debug complex issues across the stack and a track record of designing for system reliability. A bachelor's degree or an equivalent combination of education, training, and experience is the minimum educational requirement.

Apply

Visit full job listing

Disclaimer

Disclaimer: Job and company description information and some of the data fields may have been generated via GPT-4 summarisation and could contain inaccuracies. The full external job listing link should always be relied on for authoritative information.

About the company

Anthropic

Size

265

Website

anthropic.com

Public/Private

Privately Held

Description

Anthropic is an AI safety and research company focused on crafting AI systems that are reliable, interpretable, and steerable, ensuring they remain safe and beneficial for users and society. With an interdisciplinary team experienced in machine learning, physics, policy, business, and product development, Anthropic is dedicated to the mission of beneficial AI. The company emphasizes collaborative big science research, leveraging a unified team approach to focus on large-scale research efforts aimed at advancing long-term goals of creating trustworthy AI systems.

Share this job

Related jobs

Software Engineer, Systems - Claude Code

Anthropic

Computer Science

Security

Reliability

San Francisco, CA, USA

Full Time

Product Support Manager

Anthropic

Computer Science

API

New York, NY, USA

Full Time

People Research Scientist

Anthropic

Data Engineer

Machine Learning

Data Science

Computer Science

San Francisco, CA, USA

Full Time