Lead Site Reliability Engineer

10 - 13 years

0 Lacs

Posted:-1 days ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Purpose

  • Analysing, troubleshooting, and designing vital services, platforms, and infrastructure while always thinking about reliability, scalability, resilience, security, and performance.

Lead a team of SRE engineers

Role & responsibilities

  • Help build a Site Reliability Engineering culture by sharing the best practices, approaches, documentation, and code with other engineering teams
  • Apply automation and software to any tasks or parts of the system which are performed manually
  • Able to troubleshoot complicated, cross platform issues handling OS, Networking, Database in a cloud-based SaaS environment and handle live production incidents
  • Monitor application performance take steps to improve overall application performance and stability and follow through with implementation

Conduct system analysis, configuration management and develops improvements for system software performance, availability and reliability

  • Design, write, ship, and motivate the creation of software and systems to increase observability, product reliability and organizational efficiency
  • Maintain and monitoring deployment, orchestration, of the servers, docker containers, databases, and general backend infrastructure
  • Develop Run Books/Standard Operating Procedure for recurring Production issues, also working on a permanent solve.

Perform Incident Analysis on a regular basis with the intention of preventing and finding a long term solve for Incidents.

Skills:

Experience in monitoring and analyzing infrastructure performance using standard performance monitoring tools

Demonstrable experience in Containerization-Docker and orchestration (Kubernetes)

Experience with Infrastructure As Code (Terraform, Cloud Formation, Ansible)

Knowledge and proven hands-on experience in large-scale databases and distributed technologies, such as Kafka and Confluent Platform Kafka

Basic programming and scripting skills

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Hdfc Bank logo
Hdfc Bank

Banking

Mumbai Maharashtra

RecommendedJobs for You