Want to professionalize your AI skills, pivot to an AI role and increase your salary?
Master AI Engineering with the most practical and comprehensive LLM Development certifications at Towards AI Academy.

Google

Tech Lead Manager, Kubernetes AI Infrastructure

Google

Published 16 May 2026
Seattle, WA, USA& Other locations
207K - 300K USD Annual
Full Time
27500 - 275000 people

Share this job

Role Highlights

Languages used

Python
C++
Java
JavaScript

Key skills

AI
Deep Learning
NLP
Information Retrieval
Machine Learning
Computer Science
UI
Distributed Systems
System Design
Technical Leadership
LLMs
PhD
Infrastructure
Cloud
Search
Security
Data
Reliability
Operations
Research

Tools, Libraries and Frameworks

Vertex AI
GCP
Kubernetes
JAX
GPUs
TPU
PyTorch
Ray

Description

The role involves designing and maintaining Kubernetes-based systems to manage large-scale TPU infrastructure for on-premises and hybrid environments. The individual will oversee a team of software engineers focused on distributed systems and AI infrastructure. Responsibilities include guiding system designs, writing development code to address ambiguous problems, and collaborating with AI labs to influence infrastructure roadmaps. The position also requires working with cross-functional partners to deploy management tools that support large-scale generative AI tasks. Additionally, the manager will contribute to product strategy and foster a collaborative team environment.

Required Qualifications and Skills

Candidates must possess a bachelor's degree or equivalent practical experience. Required experience includes eight years in software development, three years in a technical leadership role, and two years in people management. Proficiency in programming languages such as Python, C, C++, Java, or JavaScript is necessary. Preferred qualifications include a master's degree or PhD in a technical field, along with experience in distributed systems, matrixed organizations, and deep learning frameworks.

Disclaimer

Disclaimer: Job and company description information and some of the data fields may have been generated via GPT-4 summarisation and could contain inaccuracies. The full external job listing link should always be relied on for authoritative information.

About the company

Google

Size

275773

Website

goo.gle

HQ

Mountain View, US

Public/Private

Public Company

Description

At gTech's Users and Products team, innovations are focused on enhancing user engagement and solving complex customer needs through technical expertise and a deep understanding of Google's and Alphabet's broad product environments. The team acts as a bridge between Google’s users and its product teams, ensuring that user insights are integrated into product offerings which support numerous annual product launches. The newly formed Machine Learning Data Operations (MLDO) team within gUP Operations plays a critical role in delivering and tuning machine learning and GenAI data operations across Google’s product suite, leveraging extensive global vendor networks. Google's overarching goal is to create impactful products and services, with gTech playing a strategic role in bringing these solutions to fruition through technological and operational expertise.

Share

Share this job

Related jobs

Vertex AI
Computer Science
Integrations
Test Engineer
Taipei, Taiwan
Full Time
Vertex AI
Computer Science
UX
Product Design
Thailand
Remote
AI
NLP
Information Retrieval
NLU
Zurich, Switzerland
Full Time