Software Operations Engineer

4 - 8 years

0 Lacs

Posted:2 months ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: The Apple Customer Systems Operations team is seeking a highly skilled and motivated OPS Engineer (Operations Engineer) to join their operations team. In this role, you will be responsible for maintaining the reliability, availability, and performance of business-critical, globally distributed systems. Your primary focus will involve designing and developing automation solutions to streamline system sustenance, monitoring, and operational workflows while collaborating closely with support and engineering teams. Key Responsibilities: - Manage incident management, capacity planning, deployment safety, and operational tooling - Combine strong software engineering skills with a passion for operational excellence - Thrive in a fast-paced, change-driven environment focused on continuous improvement and detailed delivery - Lead SEV1/SEV2 incident bridges, conduct blameless postmortems, and drive problem management initiatives - Utilize production support practices for handling large-scale, mission-critical web and iOS applications in a 24x7 onshore/offshore model - Troubleshoot, analyze logs, build metrics, and operational dashboards - Demonstrate a fundamental understanding of distributed systems (e.g., microservices, messaging brokers) and Linux operating system internals - Lead global operations teams in large-scale enterprise environments with collaboration and leadership skills - Utilize observability and monitoring tools such as Hubble, ExtraHop, Splunk, and similar platforms - Establish foundations in networking (HTTP, DNS, TCP/IP, ICMP, OSI Model, Subnetting, Load Balancing) - Apply hands-on engineering background with Java/JEE, REST APIs, Swift/Objective-C, databases (schema design, data access), and modern frontend technologies (React, JavaScript) - Support scalable, event-driven architectures (Kafka or equivalent), large distributed systems, and high-availability platforms - Demonstrate strong automation skills with Python/Linux, including CI/CD pipelines, Infrastructure as Code, Kubernetes/EKS (deployment strategies, scaling, troubleshooting), and self-healing systems - Apply experience in AI/ML for operational automation (anomaly detection, predictive alerting, automated incident response) - Exhibit familiarity with ITSM frameworks and enterprise support practices Qualifications Required: - Minimum 4 years of working experience - Experience in incident management and troubleshooting large-scale web and iOS applications - Strong knowledge of production support practices and distributed systems - Familiarity with networking protocols and automation skills - Experience in observability and monitoring tools and frameworks Note: Please submit your CV for consideration.,

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now
Apple logo
Apple

Computers and Electronics Manufacturing

Cupertino California

RecommendedJobs for You