Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in delhi
>
SITA
>
Lead Site Reliability Engineer/ Expert

Lead Site Reliability Engineer/ Expert

SITA

8 years

0 Lacs

delhi india

Posted:1 day ago| Platform:

Apply

Skills Required

reliability technology communication power transportation cutting support stability maintenance remediation resolve deployment integration automation efficiency collaboration development service provisioning auto analysis software engineering drive management effectiveness reports training risk devops automate code monitoring containerization architecture aws linux programming scripting certifications kubernetes azure certification diversity flex workday

Work Mode

Remote

Job Type

Full Time

Job Description

WELCOME TO SITA

At SITA, we keep airports moving, airlines flying smoothly, and borders open. Our technology and communication innovations power the success of the global air travel industry.

Great Place to Work®

Are you ready to love your job?

The adventure begins right here, with you, at SITA

PURPOSE

Responsible for the proactive support of products so that there is high product performance that is continuously improved. Responsible for identifying and resolving the root causes of operational incidents implementing solutions to improve stability and prevent recurrence. Manages the creation and maintenance of the event catalog to trigger events and develops both manual remediation approaches and automated workflows to resolve alerts. Oversees the deployment of IT services and solutions ensuring successful integration with minimal disruption. Focuses on operational automation and integration to enhance efficiency and collaboration between development and operations within service operations.

KEY RESPONSIBILITIES

Site Reliability Engineer

Define, build, and maintain support systems to ensure high availability and performance.
Handle complex cases for the Operations team.
Build events to add to the event catalog for the relevant product or application.
Implement automation for system provisioning, self-healing, auto recovery, deployment, and monitoring.
Perform incident response and root cause analysis for critical system failures.
Monitor system performance and establish service-level indicators (SLIs) and objectives (SLOs).
Collaborate with development and operations to integrate reliability best practices, including moving to zero downtime architecture.
Proactively identify and remediate performance issues.
Work closely with Product, Software & Infra Engineering and Service support architects for new product productization
Ensure Operations readiness to support new products
Coordinate with internal and external stakeholders for feedback for continual service improvement for in scope products & drive plan till successful closure
Accountable for the in-scope product to ensure high availability performance.

Problem Management

Conduct thorough problem investigations and root cause analyses (RCA) to diagnose recurring incidents and service disruptions
Coordinate with incident management teams,operations experts and collaborate with different Service Operations and Engineering teams to develop and implement permanent solutions.
Monitor the effectiveness of problem resolution activities, provide regular reports on problem management activities, and ensure continuous improvement.

Event Management

Define and maintain an event catalog, specifying active events, thresholds, and relevant remediation, and optimize it for efficiency.
Develop event response protocols, provide training to teams, and ensure quick and efficient handling of incidents.
Collaborate with stakeholders to define events, ensure coverage across the Service Operations, and drive improvements based on post-event reviews and feedback.

Deployment Management

Own the quality of new release deployment for the Service Operations, ensuring a clear process and responsibilities are assigned for smooth implementation.
Develop and maintain deployment schedules, conduct operational readiness assessments, and manage deployment risk assessments to ensure service stability.
Oversee the execution of deployment plans, coordinate resources & process with delivery and lifecycle engineering, communicate with stakeholders, and continuously work with different stakeholders to improve deployment processes based on feedback.

DevOps Management

Manage continuous integration and deployment (CI/CD) pipelines, ensuring smooth integration between development and operational teams.
Automate operational processes, monitor system performance, and resolve issues related to automation scripts to increase efficiency.
Implement and manage infrastructure as code, provide ongoing support for automation tools, and continuously improve DevOps practices.

EXPERIENCE

8+ years of experience in IT operations service management or infrastructure management or application management including roles such as Site Reliability Engineering lead or DevOps Engineer/lead.
Proven experience in managing high-availability systems and ensuring operational reliability.
Extensive experience in root cause analysis (RCA) incident management and developing permanent solutions for recurring service disruptions.
Extensive expertise in monitoring and observability implementation
Hands-on experience with CI/CD pipelines, automation system performance monitoring and the implementation of infrastructure as code.
Strong background in collaborating with cross-functional teams (development operations engineering etc.) to improve operational processes and service delivery.
Experience in managing deployments risk assessments and optimizing event and problem management processes.
Familiarity with cloud technologies containerization and scalable architecture including experience with zero-downtime deployment strategies.

KNOWLEDGE & SKILLS

Functional Skills:

Collaboration
Communication
Problem Solving
Incident Management
Change Management

Technical Skills:

Cloud Infrastructure (AWS, Azure)
Linux Administration
Windows Administration
Monitoring & Observability
DevOps (CI/CD)
Programming & Scripting Languages
Application Support

PROFESSION COMPETENCIES

Business Acumen
Consultancy
Financial Acumen
Info Gathering&Processing
Organisational Awareness
Quality Orientation

CORE COMPETENCIES

Collaboration
Communication
Problem Solving
Incident Management
Change Management
Innovation

EDUCATION & QUALIFICATIONS

Educational Background :

Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
Advanced degree (Master’s or equivalent) is often preferred for senior positions.

Qualifications

Relevant certifications such as Linux Administration, Certified Kubernetes Administrator (CKA)
Certifications in cloud platforms (AWS, Azure, Google Cloud) or DevOps methodologies (e.g., Certified DevOps Professional)
Certification in Windows Administration, Linux Administration

WHAT WE OFFER

We're all about diversity. We operate in 200 countries and speak 60 different languages and cultures. We're really proud of our inclusive environment. Our offices are comfortable and fun places to work, and we make sure you get to work from home too. Find out what it's like to join our team and take a step closer to your best life ever.

Flex Week:

Flex Day:

Flex-Location:

Employee Wellbeing:

Professional Development:

Competitive Benefits:

SITA is an Equal Opportunity Employer. We value a diverse workforce. In support of our Employment Equity Program, we encourage women, aboriginal people, members of visible minorities, and/or persons with disabilities to apply and self-identify in the application process.

More Jobs at SITA

Senior Analyst Business Finance

Mumbai Metropolitan Region

Experience: Not specified

Salary: Not disclosed

Associate Project Manager

Delhi, India

2 - 2 yrs

Salary: Not disclosed

Apprentice

Bihar, India

Experience: Not specified

Salary: Not disclosed

Associate Specialist Service Operations

Delhi, India

3.0 - 3.0 yrs

Salary: Not disclosed

Scrum Master

Delhi, India

3.0 - 3.0 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

SITA

Login to

Please Verify Your Phone or Email

Confirm Action

Lead Site Reliability Engineer/ Expert