Home
Jobs

Site Reliability Engineer, AVP

0 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Join our digital revolution in NatWest Digital XIn everything we do, we work to one aim. To make digital experiences which are effortless and secure.So we organise ourselves around three principles: engineer, protect, and operate. We engineer simple solutions, we protect our customers, and we operate smarter.Our people work differently depending on their jobs and needs. From hybrid working to flexible hours, we have plenty of options that help our people to thrive.This role is based in India and as such all normal working days must be carried out in India.Join us as a Site Reliability Engineer
  • You’ll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ)
  • We’ll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of CCJ across applications
  • This is a great chance to work in a supportive environment with opportunities to advance your personal and career development
  • We're offering this role at associate vice president level
What you'll do
As a Site Reliability Engineer, you’ll collaborate with feature teams to understand application changes, participate in delivery activities, and address production issues to assist in the delivery of change that does not negatively affect the customer experience. You'll contribute to site reliability operations which will include production support, incident response, on-call rota, toil reduction, and application performance. You'll also proactively lead improvement to release quality into production and provide highly available, performing, and secure production systems.Other responsibilities will include:
  • Delivering automation solutions to minimise and eliminate manual tasks associated with maintaining and supporting the applications
  • Ensuring in-depth understanding of the full tech stack on which the application resides and depends on
  • Identifying alerting and monitoring requirements for an application, based on sound understanding of customer journeys
  • Evaluating the resilience of the end-to-end tech stack on which the applications depend, and addressing weaknesses
  • Seeking to reduce frequency of hand-offs in the end-to-end resolution of customer-impacting incidents
The skills you'll need
To succeed in this role, you’ll need experience of supporting live production services serving customer journeys with a demonstrable knowledge of ITIL processes and IT Security principles along with tools and techniques to prevent compliance breaches. You'll have hands on experience with Azure Cloud and full-stack observability using tools such as Log Analytics, Application Insights, and Grafana.You’ll also need:
  • At least seven years of experience in a Site Reliability Engineer (SRE) or DevOps role
  • Strong experience with cloud platforms like AWS, Azure, Google Cloud and containerization technologies like Docker, Kubernetes
  • Experience with automation and configuration management tools like Ansible, Terraform, and Chef
  • Experience in deployment and release services, automation and troubleshooting
  • Strong verbal and written communication skills

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You