Want to professionalize your AI skills, pivot to an AI role and increase your salary?
Master AI Engineering with the most practical and comprehensive LLM Development certifications at Towards AI Academy.

IBM

Senior Spark Developer

IBM

Published 02 Apr 2026
Bangalore, India
Full Time

Share this job

Role Highlights

Languages used

SCALA

Key skills

Machine Learning
Big Data
Computer Science
Integrations
CICD
Code Reviews
Automated Testing
Functional Programming
Transformation
Storage
Security
Cluster
Optimization
Cloud
NOSQL

Tools, Libraries and Frameworks

IOS
IBM
HDFS
Hive
Cassandra
Kafka
AWS
Azure
GCP
EMR
DataBricks
Git
Jenkins
Apache Spark

Description

\\\\Introduction\\\\ At IBM Software, we transform client challenges into solutions. Building the worlds leading AI-powered, cloud-native products that shape the future of business and society. Our legacy of innovation creates endless opportunities for IBMers to learn, grow, and make an impact on a global scale. Working in Software means joining a team fueled by curiosity and collaboration. Youll work with diverse technologies, partners, and industries to design, develop, and deliver solutions that power digital transformation. With a culture that values innovation, growth, and continuous learning, IBM Software places you at the heart of IBMs product and technology landscape. Here, youll have the tools and opportunities to advance your career while creating software that changes the world. \\\\Your role and responsibilities\\\\ Key Responsibilities: \\\ Design and optimize big data applications using Apache Spark and Scala. \\\ Tune Spark jobs for performance and cost efficiency on distributed clusters. \\\ Maintain reusable libraries and ensure best coding practices. \\\ Work with storage systems such as HDFS, Hive, HBase, Cassandra, Kafka, and Parquet. \\\ Mentor junior developers and lead code reviews. \\\ Ensure compliance with security and governance standards. \\\ Troubleshoot and resolve performance issues in big data solutions. \\\\Required technical and professional expertise\\\\ Required Qualifications: \\\ Bachelors or Masters in Computer Science or related field. \\\ 12+ years of software development experience. \\\ 5+ years of hands-on experience with Apache Spark and Scala. \\\ Strong knowledge of distributed computing and cluster frameworks. \\\ Proficiency in Scala and functional programming principles. \\\ Expertise in Spark tuning, partitions, joins, and optimization techniques. \\\ Experience with cloud platforms (AWS, Azure, GCP) and tools like EMR, Databricks, HDInsight. \\\ Familiarity with Kafka, Hive, HBase, NoSQL databases, and data lake architectures. \\\ Knowledge of CI/CD, Git, Jenkins, and automated testing. \\\ Strong problem-solving and collaboration skills. \\\\Preferred technical and professional experience\\\\ Preferred: \\\ Experience with Databricks, Delta Lake, or Apache Iceberg. \\\ Exposure to machine learning pipelines using Spark MLlib or integration with ML frameworks. \\\ Open-source contributions in big data projects. \\\* Excellent communication and leadership abilities. IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

Required Qualifications and Skills

The role requires a Bachelors or Masters in Computer Science or a related field. Candidates need over 12 years of software development experience, with at least 5 years specifically in Apache Spark and Scala. Strong knowledge of distributed computing and cluster frameworks is essential, along with proficiency in Scala and functional programming principles. Expertise in Spark tuning, partitions, joins, and optimization techniques is required. Familiarity with cloud platforms like AWS, Azure, or GCP, and tools such as EMR or Databricks, is also necessary. Experience with Kafka, Hive, HBase, NoSQL databases, and data lake architectures is expected.

Disclaimer

Disclaimer: Job and company description information and some of the data fields may have been generated via GPT-4 summarisation and could contain inaccuracies. The full external job listing link should always be relied on for authoritative information.

About the company

IBM

Size

305978

Website

ibm.com

HQ

Armonk, New York, US

Public/Private

Public Company

Description

IBM infuses core business operations with intelligence, from machine learning to generative AI, to make organizations more responsive, productive, and resilient. It helps clients put AI into action now, creating real value with trust, speed, and confidence across various areas like digital labor, IT automation, and security. The ability to utilize all data is critical, as AI's effectiveness is dependent on the quality of data fueling it, with IBM's AI, and data platform aiming to scale and accelerate AI's impact with trusted data. IBM's hybrid cloud platform offers a comprehensive approach to development, security, and operations across hybrid environments, laying a flexible foundation for leveraging data wherever it resides.

Share

Share this job

Related jobs

AI
Deep Learning
NLP
Machine Learning
Cambridge, MA, USA
Full Time
Data Engineer
Integrations
Open Source
Data Processing
Bangalore, India
Full Time
Tech Lead
API
CICD
Product Development
Pune, India
Full Time