Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in Bengaluru
>
Apple
>
Site Reliability Engineering Manager

Site Reliability Engineering Manager

Apple

10 years

0 Lacs

Bengaluru

Posted:1 month ago| Platform: GlassDoor logo

Apply

Skills Required

reliability engineering service diversity data support power analytics finance marketing kafka spark airflow maintenance code reporting leadership learning design drive automation tooling optimization efficiency engagement resolve escalation analysis collaboration security documentation architecture etl apache messaging aws gcp kubernetes programming python java scala management visualization tableau troubleshooting reports sap remediation

Work Mode

On-site

Job Type

Part Time

Job Description

Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It’s the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something - you’ll add something. Apple's Artificial Intelligence and Data Platforms (AiDP) team is seeking an experienced Site Reliability Engineering (SRE) Manager to support scalable and resilient distributed systems that power Apple's data pipelines and analytics platforms. Our Enterprise Data Warehouse landscape caters to a wide variety of real-time, near real-time and batch analytical solutions. These solutions are an integral part of business functions like Sales, Operations, Finance, AppleCare, Marketing and Internet Services, enabling business drivers to make critical decisions. We utilizes proprietary and open source technologies such as Kafka, Spark, Iceberg, Airflow, and others to build these solutions. If you are passionate about addressing infrastructure challenges at scale, both on-premises and in the cloud, and focused on optimizing scalable solutions by prioritizing ease of use and maintenance, you will discover exciting opportunities in AiDP.
Description
As a hands-on SRE Manager, you’ll lead by example-actively driving operational excellence, contributing to code, and ensuring system reliability. You will be deeply involved in incident response across complex, distributed data platforms designed to support data exploration, analytics, and reporting solutions. These platforms operate at the unique intersection of high data volume and hybrid infrastructure, spanning both cloud and on-premise environments. Responsibilities

Lead by Example: Provide technical leadership and guidance to SRE team by applying hands-on skills and continuous learning. Build and mentor a world-class engineering team that partners closely with platform teams to design scalable, reliable systems, while contributing actively to both platform and application code.
Drive Automation for Data Platforms and Infrastructure: Manage Infrastructure as Code (IaC) and develop tooling to enhance engineering productivity. Lead initiatives for cost optimization and operational efficiency at scale.
Incident Response and On-Call Engagement: Actively participate in on-call rotations and resolve critical production issues. Lead response efforts during major incidents and serve as the primary escalation point for complex problems.
Drive Post-Incident Analysis: Perform root cause investigations and ensure follow-up with actionable postmortems and infrastructure hardening initiatives. Implement fixes-in code, infrastructure, or processes-to prevent recurrence.
Active Collaboration with Cross-Functional Teams: Partner closely with engineering teams to troubleshoot issues, deploy fixes, and enhance system reliability. Champion operational excellence through direct technical contributions.

Minimum Qualifications

Hands-on experience supporting and maintaining applications in cloud or hybrid environments
Expertise in cloud-native services, including ETL frameworks (Apache Spark, Flink), and messaging systems (Kafka)
Strong knowledge of cloud infrastructure & services (e.g., AWS, GCP, Kubernetes), Observability tools (e.g: Prometheus, Grafana, CloudWatch)
Programming experience in Python, Java, or Scala
Proven ability to lead incident response, perform root cause analysis, and drive system reliability improvements
Bachelor’s degree or equivalent, with 10+ years of experience in the SRE domain and at least 2 years in a management role focused on leading, hiring, developing and building teams

Preferred Qualifications

Hands-on experience supporting enterprise data systems on distributed architectures
Exposure to data visualization tools such as Tableau, Business Objects, ThoughtSpot, with experience supporting and troubleshooting issues related to dashboards and reports
Experience with modern & distributed databases such as Snowflake, Cassandra, SingleStore, and SAP HANA
Experience using GenAI or automation tools for issue detection, alerting, or remediation

Submit CV

More Jobs at Apple

Global Time Away and Allowance Partner

Bengaluru, Karnataka

5 - 5 yrs

Salary: Not disclosed

Manufacturing Quality Auditor (MQA)

Chennai, Tamil Nadu

Experience: Not specified

Salary: Not disclosed

Test Automation Architect - Operations Business Process Reengineering

Bengaluru, Karnataka

15 - 15 yrs

Salary: Not disclosed

Software Engineer- AEM

Hyderabad

4 - 9 yrs

INR 40 - 45 Lacs

Fullstack Software Engineer - Global Sourcing & Supply Management

Bengaluru

7 - 12 yrs

INR 20 - 25 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Apple

Computers and Electronics Manufacturing

Cupertino California

Login to

Please Verify Your Phone or Email

Confirm Action

Site Reliability Engineering Manager