Senior Engineering Manager, SRE (Loyalty)

6 - 10 years

0 Lacs

Posted:1 day ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: As a Senior Engineering Manager Site Reliability Engineering (SRE) at Marriott Tech Accelerator, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical cloud and on-prem services that cater to millions of Marriott customers worldwide. Your role will involve overseeing incident management, driving automation efforts, and collaborating with cross-functional teams to align SRE strategy with business objectives. It is essential to possess strong communication skills to optimize cloud infrastructure and maintain operational excellence in a dynamic environment. Key Responsibilities: - Ensure the reliability, availability, and performance of mission-critical cloud services by implementing best practices for monitoring, alerting, and incident management. - Oversee the management of high-severity incidents, facilitating quick resolution and conducting post-incident analysis to prevent recurrence. - Drive the automation of operational processes to support growing user demand effectively, optimizing cloud and on-prem infrastructure and resource usage. - Develop and execute the SRE strategy aligned with business goals, and communicate service health, reliability, and performance metrics to senior leadership and stakeholders. Qualifications Required: - 8-10 years of experience in information technology process and/or technical project management, including 4+ years as a Site Reliability Engineer (SRE) with 2+ years on public cloud, preferably AWS. - Proven automation and programming experience in languages like Java, Python, Go, Perl, and Bash. - Deep understanding of SRE practices such as Service Level Objectives, Error Budgets, Toil Management, Observability & Monitoring, Blameless Postmortems, Incident Response Process, and Capacity Planning. - Strong working knowledge of modern, continuous development techniques and pipelines (Agile, Kanban, Jira, CI/CD, Jenkins, Git, Artifactory). - Production level expertise with containerization orchestration engines such as Kubernetes. - Experience with deploying, monitoring, and troubleshooting large-scale, distributed applications in cloud environments like AWS. - Familiarity with security frameworks such as ISO27001, SOCII, PCI-DSS, and/or HIPAA. - Ability to work with global teams located in the US and India. - Undergraduate degree in Computer Science or related technical field or equivalent experience/certification. Additional Company Details (if any): Marriott Tech Accelerator is a part of Marriott International, a global leader in hospitality with over 30 well-known brands and nearly 8,900 properties in 141 countries and territories. As a leading American multinational company, Marriott International operates a wide range of lodging brands, including hotels and residential properties. The work location for this position is in Hyderabad, India, with a hybrid work mode. Role Overview: As a Senior Engineering Manager Site Reliability Engineering (SRE) at Marriott Tech Accelerator, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical cloud and on-prem services that cater to millions of Marriott customers worldwide. Your role will involve overseeing incident management, driving automation efforts, and collaborating with cross-functional teams to align SRE strategy with business objectives. It is essential to possess strong communication skills to optimize cloud infrastructure and maintain operational excellence in a dynamic environment. Key Responsibilities: - Ensure the reliability, availability, and performance of mission-critical cloud services by implementing best practices for monitoring, alerting, and incident management. - Oversee the management of high-severity incidents, facilitating quick resolution and conducting post-incident analysis to prevent recurrence. - Drive the automation of operational processes to support growing user demand effectively, optimizing cloud and on-prem infrastructure and resource usage. - Develop and execute the SRE strategy aligned with business goals, and communicate service health, reliability, and performance metrics to senior leadership and stakeholders. Qualifications Required: - 8-10 years of experience in information technology process and/or technical project management, including 4+ years as a Site Reliability Engineer (SRE) with 2+ years on public cloud, preferably AWS. - Proven automation and programming experience in languages like Java, Python, Go, Perl, and Bash. - Deep understanding of SRE practices such as Service Level Objectives, Error Budgets, Toil Management, Observability & Monitoring, Blameless Postmortems, Incident Response Process, and Capacity Planning. - Strong working knowledge of modern, continuous development techniques and pipelines (Agile, Kanban, Jira, CI/CD, Jenkins, Git, Artifactory). - Production level expertise with containerization orchestration engines such as Kubernetes. - Experience wi

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now
ANSR logo
ANSR

Computers and Electronics Manufacturing

Austin

RecommendedJobs for You