Associate Director, Software Engineering

12 - 17 years

40 - 50 Lacs

Posted:6 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

We are currently seeking an experienced professional to join our team in the role of Associate Director, Software Engineering
In this role you will:
Analyse incident and change data to identify patterns, root causes, and systemic risks.
Define and track service health metrics (MTTR, failure rates, change success, etc.).
Partner with product and support teams to implement reliability improvements.
Accountability for the control and compliance of the engineering process.
Promote innovation and adoption of cutting-edge specialist technologies and practices with the domain.
Build and maintain dashboards and reporting to support visibility and accountability.
Support automation and tooling initiatives to reduce toil and improve response times.
Contribute to readiness assessments for releases and major infrastructure events.
Collaborate across global teams to ensure service resilience is embedded in ways of working.
Requirements

Platform Reliability & Operations

  • Maintain and support mission-critical digital security platforms across AWS Cloud (EKS, EC2, CloudFront) and hybrid environments.
  • Implement observability using tools like CloudWatch, Splunk, Prometheus, AppDynamics (or similar).
  • Ensure 99.9%+ uptime through robust monitoring, alerting, and proactive incident management.

Incident Management & Problem Resolution

  • Lead incident response, blameless postmortems, and drive root cause analysis (RCA).
  • Collaborate with development and infrastructure teams to remediate vulnerabilities and improve system resilience. Infrastructure Automation & CI/CD
  • Automate infrastructure provisioning using Terraform, CloudFormation, and manage Kubernetes clusters on Amazon EKS.
  • Build and maintain CI/CD pipelines for secure and reliable application deployments.

Security & Compliance

  • Work closely with security teams to implement and maintain identity and access management solutions like ForgeRock and Transmit.
  • Ensure compliance with regulatory and organizational security policies.

SRE Best Practices & Reliability Engineering

  • Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs).
  • Drive capacity planning, cost optimization, chaos engineering, and disaster recovery exercises.
  • Advocate for DevSecOps and Security by Design principles.

Skills & Qualifications:

12+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles.
Strong hands-on experience with:
  • Cloud Platform AWS (EKS, EC2, CloudFront, S3, Lambda, RDS)
  • Linux, Shell/Python scripting,
  • CI/CD, Kubernetes, Docker,
  • Monitoring tools (Splunk & AppDynamics), hands-on ITIL Process
  • Infrastructure as Code (Terraform, CloudFormation)
  • ForgeRock, Transmit Security (preferred) Strong understanding of networking, security principles, and identity management. Experience with monitoring/logging (Splunk, AppDynamics, CloudWatch, Grafana, ELK). Expertise in automation and scripting (Python, Shell, or equivalent). Solid understanding of SRE concepts:
  • SLI/SLO/SLA
  • Error budgets
  • Blameless postmortems
  • Incident response

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Hsbc logo
Hsbc

Financial Services

London

RecommendedJobs for You