Want to professionalize your AI skills, pivot to an AI role and increase your salary?
Master AI Engineering with the most practical and comprehensive LLM Development certifications at Towards AI Academy.

Visa

Staff Site Reliability Engineer - Linux, Containers, Kbs, Automation, GenAI

Visa

Published 06 Apr 2026
Bangalore, India
Full Time

Share this job

Role Highlights

Languages used

Swift
Python
Java
C#
SQL

Key skills

Machine Learning
Anomaly Detection
CICD
Product Development
SME
SRE
Site Reliability
API
UI
TroubleShooting
Operations
MBA
PhD
Architecture
Middleware
Logging
AI
Containers
Automation

Tools, Libraries and Frameworks

PowerShell
Mongo
Jenkins
Chef
Kafka
Kubernetes
Splunk
Prometheus
Grafana
Linux
ITIL

Description

Visa Staff Site Reliability Engineer - Linux, Containers, Kbs, Automation, GenAI \\\| SmartRecruiters Google Chrome Microsoft Edge Apple Safari Mozilla Firefox . Staff Site Reliability Engineer - Linux, Containers, Kbs, Automation, GenAI Full-time Job Family Group: Engineering and Technology Company Description Visa is a world leader in payments technology, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories, dedicated to uplifting everyone, everywhere by being the best way to pay and be paid. At Visa, you'll have the opportunity to create impact at scale tackling meaningful challenges, growing your skills and seeing your contributions impact lives around the world. Join Visa and do work that matters to you, to your community, and to the world. Progress starts with you. Job Description We seek an experienced IT professional to join us as a Staff Site Reliability Engineer, working in the Product Reliability Engineering function who will: Perform day-to-day site reliability engineering functions including Maintenance and incident resolution for all Debit applications, products, and services including debit, prepaid and risk lines of business. Perform ongoing/Proactive analysis of various debit authorization, Api and UI based applications to detect potential problems and actively engage & facilitate the discussion to find the best possible solution. Work under the Guidance of technical subject matter experts and be point of contact for key DPS projects. Work closely with service partners such as product development, engineering teams to seamlessly implement the innovative solutions to improve the reliability, scalability, and efficiency. Contribute towards automating the routine tasks and processes to improve overall efficiency and reduce human errors. Actively participate in troubleshooting activities and SWAT calls and drive investigation towards swift resolution. Participate in the Major Problem Review discussions, drive the root cause analysis, identify the gaps, and come up with innovative preventive measures. Mentor junior team members and foster a culture of continuous improvement in the team through retrospectives and open feedback. Build comprehensive and robust documentation repositories that can facilitate knowledge transfer among DPS PRE and DPS Global Operations peers. Implement innovative GenAI and machine learning trends to continuously optimize the application reliability and efficiency. Work with observability team to design and implement the modern visa observability solutions such as Anomaly detection, operations intelligent platform (OIP), Fault Isolation tool (FIT) across all DPS products. Provide on-call support in 12\\7 model. Self-motivated, and have excellent interpersonal and communication skills. This is a hybrid position. Expectation of days in the office will be confirmed by your Hiring Manager. Qualifications Preferred Qualifications: 8+ years of relevant work experience with a Bachelors Degree or at least 4 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 2 years of work experience with a PhD, OR 8+ years of relevant work experience. 5 years of experience and Proficiency in one or more programming languages such as Python, Java, .NET, C#, PowerShell, Bash scripting. 3 or more years of experience leading the projects, key technical initiatives. This role requires a high level of technical expertise, leadership skills, and a strong understanding of site reliability engineering principles and practices. 5 years of experience and advanced proficiency in writing complex queries and working with SQL and mongo databases. Prior experience working on CI/CD pipelines and tools like Jenkins, chef etc. Prior experience partnering with product development team and evaluating application design for optimal reliability and resiliency. Prior experience and Strong understanding of networking concepts, protocols, and architecture. Advanced working knowledge of ITIL concepts & processes such as incident/change/problem management, call triaging, escalation procedures and such. Prior experience with Middleware components such as Kafka, Hazelcast, Qlik etc. Advanced proficiency and experience with container orchestration systems, particularly Kubernetes. Experience with advanced monitoring, logging, and tracing tools such as Splunk, Prometheus, Grafana, riverbed etc., for troubleshooting and performance tuning. Basic understanding of AI frameworks and libraries to further enhance the application resiliency and day to day operational tasks. Prior experience with building tools to automate production support activities that enable efficiency and productivity of all operations groups. Prior experience working in shift model in 24\\7 environments. Candidate should be comfortable communicating with technical and non-technical peer groups, including Account Management, Client Services, and other technical platform and application support groups. Strong work ethic, self-starter, ability to work in fast-paced, team-oriented environment. Additional Information Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law. I'm interested I'm interestedPrivacy NoticeCookies Settings I'm interested Refer a friend Posted by Sairam Bellamkonda share this job Share on LinkedIn Share on Facebook Share on Twitter Share via email Share on Xing Share on WeChat Other jobs at Visa Director, Software Engineering - GenAI Bellevue, WA Staff Software Engineer-AI Solutions Highlands Ranch, CO Lead Data Architect Bellevue, WA Show all jobs Share to WeChat × Copy the link and open WeChat to share. Copy to clipboard Open WeChat Share to WeChat × Use Scan QR Code in WeChat and click ··· to share. Staff Site Reliability Engineer - Linux, Containers, Kbs, Automation, GenAI Bengaluru, India Full-time I'm interested I'm interested

Required Qualifications and Skills

The role requires a high level of technical expertise, leadership skills, and a strong understanding of site reliability engineering principles and practices. Candidates should have 8+ years of relevant work experience with a Bachelor's Degree, or at least 4 years of work experience with an Advanced degree, or 2 years of work experience with a PhD. Alternatively, 8+ years of relevant work experience is acceptable. Proficiency in one or more programming languages such as Python, Java, .NET, C#, PowerShell, or Bash scripting is required, with 5 years of experience. Three or more years of experience leading projects and key technical initiatives are also necessary. Advanced proficiency and experience with container orchestration systems, particularly Kubernetes, are essential. Experience with advanced monitoring, logging, and tracing tools such as Splunk, Prometheus, Grafana, and riverbed is required for troubleshooting and performance tuning. A basic understanding of AI frameworks and libraries is also mentioned to enhance application resiliency and operational tasks.

Disclaimer

Disclaimer: Job and company description information and some of the data fields may have been generated via GPT-4 summarisation and could contain inaccuracies. The full external job listing link should always be relied on for authoritative information.

About the company

Visa

Size

25017

Website

visa.com

HQ

Foster City, US

Public/Private

Public Company

Description

Visa is a world leader in digital payments, facilitating more than 215 billion payment transactions between various parties across over 200 countries and territories each year. Their mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive. By including everyone everywhere, Visa believes in uplifting everyone everywhere. Working with Visa means contributing to a culture that embraces identity and purpose, ensuring that the work has a direct impact on billions of people globally by unlocking financial access and enabling the future of money movement.

Share

Share this job

Related jobs

Data Science
Data Engineer
Big Data
Project Management
USA
Full Time
Machine Learning
Data Science
Vector Database
Computer Science
USA
Full Time
Integrations
Manager
AI
Infrastructure
San Francisco, CA, USA
Full Time