Service Reliability Infra Analyst

13 - 17 years

0 Lacs

Posted:16 hours ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Service Reliability Advisor at Arm, you will play a crucial role in improving the availability and performance of Arm infrastructure by leveraging Arms AI Operations (AIOPS) and observability platforms. Your responsibilities will include: - Acting as the first line of response for all events and incidents related to the infrastructure, ensuring timely mitigation of service impact - Proactively managing core server, storage, identity, and engineering platforms - Assisting in onboarding new infrastructure and monitoring capabilities into the observability stack - Supporting and enhancing alerting and incident response workflows - Automating repetitive tasks to enhance efficiency and minimize manual intervention - Conducting root cause analysis of incidents and implementing preventive measures - Managing incidents to suppliers and participating in technical on-call rotas as needed - Logging all issues in the Service Management Tool and ensuring timely resolution within EIT service levels - Working on a shift pattern in a 24/7/365 operating model, demonstrating flexibility and independence during emergencies or critical issues Required Skills And Experience: - Experience supporting infrastructure across cloud and on-prem solutions in areas like server, storage, virtualization, or engineering platforms - 13 years of hands-on experience in Platform Operations or Infrastructure Support roles - Proficiency in observability tools for real-time monitoring, alerting, and diagnostics - Familiarity with scripting or programming languages such as Python, Java, .NET, Node.js, Ansible, or JavaScript - Understanding of UAM and IAM across on-Premise OUD LDAP and Azure AD - Experience with Windows and Linux operating systems, as well as engineering tools like Github, Jira, and Confluence - Ability to adapt to new skills and technologies as the scope of responsibility expands - Strong communication skills, proactive approach, and personal accountability for outcomes - Proficiency in incident analysis and recommendation of reliability improvements - Experience in ticket management via an ITSM platform such as ServiceNow Nice To Have Skills And Experience: - Exposure to high-performance computing or cloud-native services - Experience in creating or managing Ansible playbooks for repetitive tasks or configuration - Interest in automation and DevOps practices If you require any adjustments or accommodations during the recruitment process, please reach out to accommodations@arm.com. Arm is committed to providing equal opportunities to all candidates. Arm's hybrid working approach offers flexibility to employees, empowering them to determine their own hybrid working patterns based on work and team needs. Specific details regarding hybrid working for each role will be discussed during the application process. Arm values both high performance and personal wellbeing in its hybrid working model.,

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You