Want to professionalize your AI skills, pivot to an AI role and increase your salary?
Master AI Engineering with the most practical and comprehensive LLM Development certifications at Towards AI Academy.

Palo Alto Networks

Senior Site Reliability Engineer (NetSec)

Palo Alto Networks

Published 23 Dec 2025
Tel Aviv, Israel
Remote

Share this job

Role Highlights

Languages used

Python
Java

Key skills

Computer Science
CICD
Distributed Systems
IAC
Cloud Environments
Site Reliability
Network Security
Data
SRE
Infrastructure
Automation
Architecture
Storage
Kernel
Microservices
Routing
AI
LLMs
Machine Learning

Tools, Libraries and Frameworks

Unix
Linux
Shell
Kubernetes
AWS
GCP
Azure
Terraform
Chef
Puppet
DNS
Ansible

Description

\\\\Our Mission\\\\ At Palo Alto Networks® everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and were looking for innovators who are as committed to shaping the future of cybersecurity as we are. \\\\Who We Are\\\\ This role is remote, but distance is no barrier to impact. Our hybrid teams collaborate across geographies to solve big problems, stay close to our customers, and grow together. You will be part of a culture that values trust, accountability, and shared success where your work truly matters. \\\\Your Career\\\\ The SASE Platform team builds and operates highly available, secure, and globally distributed services that protect users, applications, and data for some of the worlds largest enterprises. Our mission is to deliver cloud-native security and networking capabilities that seamlessly converge networking and security at scale. As enterprises accelerate adoption of cloud, remote work, and AI-driven workloads, the need for resilient, observable, and secure SASE platforms has never been greater. As an SRE, you will play a critical role in ensuring our platform is reliable, scalable, performant, and secure from day one. \\\\Your Impact\\\\ As a Site Reliability Engineer, you will be an integral part of the product and platform lifecycle, partnering closely with software engineers, security experts, and infrastructure teams. You will: \\+ Collaborate with development teams to embed reliability, scalability, and operability into services from the earliest design stages \\+ Design, review, and evolve cloud-native architectures to improve availability, performance, cost efficiency, and fault tolerance \\+ Build and operate automation for provisioning, deploying, and managing infrastructure at global scale using Infrastructure as Code \\+ Improve CI/CD pipelines and release processes to enable safe, fast, and repeatable deployments \\+ Drive observability best practices, including metrics, logs, traces, SLIs/SLOs, and data-driven incident analysis \\+ Participate in on-call rotations, continuously reducing MTTR through automation, runbooks, and proactive reliability improvements \\+ Mentor and guide engineers on large-scale cloud and SASE deployments, fostering a strong SRE culture \\+ Participate in architecture and design reviews, bringing a reliability and operational excellence mindset \\+ Champion reliability, security, and operational maturity across the organization \\\\Your Experience\\\\ \\+ Bachelors degree in Engineering, Computer Science, or a related technical field (or equivalent practical experience) \\+ 5+ years of experience working with Unix/Linux systems (shell, tools, networking, storage, kernel concepts) \\+ 2+ years of hands-on experience with microservices architectures running on Kubernetes and container platforms \\+ Strong understanding of distributed systems design, fault tolerance, scalability patterns, and high-availability architectures \\+ Experience operating workloads in public cloud environments (AWS, GCP, Azure, or hybrid) at medium to large scale \\+ Proficiency in building automation and tools in Python, Java, or similar languages for production environments \\+ Strong Infrastructure as Code experience (Terraform, Ansible, Chef, Puppet, or similar) \\+ Experience designing and operating monitoring, alerting, and observability systems at scale \\+ A tools-first mindset with a passion for reducing toil and increasing engineering efficiency \\+ Excellent communication skills and the ability to lead discussions across engineering and security teams \\+ Experience applying reliability and security frameworks to design, review, and operate production systems Nice to have: \\+ Networking expertise, including TCP/IP, DNS, BGP, routing, load balancing, proxies, VPNs, and cloud networking conceptsespecially relevant to SASE architectures \\+ Experience operating or supporting SASE, SD-WAN, Zero Trust, or network security platforms \\+ Familiarity with AI/LLM technologies, including: \\+ Using LLMs to improve operational workflows (incident analysis, alert enrichment, runbooks, automation) \\+ Experience integrating AI/ML services into production systems \\+ Understanding of reliability, security, and governance considerations for AI-driven services \\\\The Team\\\\ Our engineering organization is at the heart of our mission to deliver secure, reliable digital experiences and prevent cyberattacks. We dont just follow industry trendswe help define them. Our teams thrive in ambiguity, embrace complex challenges, and are motivated by building platforms that operate at global scale with uncompromising reliability and security. \\\\Our Commitment\\\\ Were problem solvers that take risks and challenge cybersecuritys status quo. Its simple: we cant accomplish our mission without diverse teams innovating, together. We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at . Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics. All your information will be kept confidential according to EEO guidelines.

Required Qualifications and Skills

The role requires a Bachelors degree in Engineering, Computer Science, or a related technical field, or equivalent practical experience. Candidates must have over five years of experience with Unix/Linux systems, including shell, tools, networking, storage, and kernel concepts. Additionally, two or more years of hands-on experience with microservices architectures running on Kubernetes and container platforms is necessary. A strong understanding of distributed systems design, fault tolerance, scalability patterns, and high-availability architectures is essential. Experience operating workloads in public cloud environments like AWS, GCP, or Azure at medium to large scale is required, along with proficiency in building automation and tools in Python, Java, or similar languages for production environments. Strong Infrastructure as Code experience with tools such as Terraform, Ansible, Chef, or Puppet is also a requirement. Experience designing and operating monitoring, alerting, and observability systems at scale is expected, along with a tools-first mindset focused on reducing toil and increasing engineering efficiency. Excellent communication skills and the ability to lead discussions across engineering and security teams are important. Experience applying reliability and security frameworks to design, review, and operate production systems is also noted.

Disclaimer

Disclaimer: Job and company description information and some of the data fields may have been generated via GPT-4 summarisation and could contain inaccuracies. The full external job listing link should always be relied on for authoritative information.

About the company

Palo Alto Networks

Size

14705

Founded

HQ

SANTA CLARA, US

Public/Private

Public Company

Description

Palo Alto Networks, the global cybersecurity leader, is shaping the cloud-centric future with technology that is transforming the way people and organizations operate. Our mission is to be the cybersecurity partner of choice, protecting our digital way of life. We help address the world's greatest security challenges with continuous innovation that seizes the latest breakthroughs in artificial intelligence, analytics, automation, and orchestration. By delivering an integrated platform and empowering a growing ecosystem of partners, we are at the forefront of protecting tens of thousands of organizations across clouds, networks, and mobile devices. Our vision is a world where each day is safer and more secure than the one before. For more information, visit www.paloaltonetworks.com.

Share

Share this job

Related jobs

Data Lakes
Computer Science
Integrations
API
Santa Clara, CA, USA
Full Time
Palo Alto Networks

AI Financial Analyst

Palo Alto Networks

Scikit-learn
AI
Data Governance
Machine Learning
Santa Clara, CA, USA
Full Time
Internship
Integrations
Product Management
CICD
Distributed Systems
Santa Clara, CA, USA
Full Time