Hiring For Site Reliability Engineer

6 - 11 years

15 - 25 Lacs

Posted:5 hours ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Primary Responsibilities

Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale.

Knowledge &Skills

  • In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc.
  • Should have experience in any Monitoring and Observability tools: Grafana, Splunk, Dynatrace, gcp operation suite etc.
  • Should have understanding and knowledge into any APM tools App dynamics, Datadog etc preferably app dynamics.
  • Should have experience in IaC: Terraform, Ansible etc.
  • Should have experience working with cloud-native applications to manage them effectively in GCP or Azure.
  • Should have experience into creating pipelines in CI/CD any tools like GitHub action, Azure devops, Jenkins etc.
  • Should have knowledge into version control any tools like Git,BitBucket etc.
  • Knowledge into any of the scripting languages like powershell,python,bash etc.
  • Coding infrastructure automation across the CI/CD pipeline
  • Responsible for ensuring the availability, performance, and scalability of a website or application.
  • Knowledge into containerization and orchestration: Docker, Kubernetes, Cloudrun(GCP) etc.
  • Involved in capacity planning and performance tuning to ensure that the site can handle increased traffic without issue.
  • Responsible for ensuring the availability, performance, and scalability of a website or application.
  • Should have experience working with cloud-native applications to manage them effectively.
  • Work closely with developers to identify and fix potential issues before they cause problems for users.
  • Deep understanding of how distributed systems work in order to be able to troubleshoot and optimize them.
  • Deep understanding of how different types of databases work in order to be able to effectively troubleshoot any issues that may arise.
  • Ability to communicate clearly and concisely about system alerts or outages to other members of your team.
  • Below points to be noted: Apart from JD, Customer is looking for a candidate who can mature their SRE practice across the division. Someone who is comfortable being a champion and leader in the SRE space.

sandeep.a@talent21.in

Looking for PAN India and open to work in Hybrid model

Total experience

Relevant experience into SRE

Current CTC

Expected CTC

Pan Number

Date of Birth

Notice Period

Note - Please apply if your notice period is less than 45 days or if you are currently serving notice period.

Mock Interview

Practice Video Interview with JobPe AI

Start Azure DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Talent21 logo
Talent21

Staffing and Recruitment

Human Resource City

RecommendedJobs for You