Senior Site Reliability Engineer

4 years

13 Lacs

Posted:19 hours ago| Platform: GlassDoor logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Key Responsibilities

  • Design, implement, and maintain comprehensive monitoring, logging,

and alerting solutions across our production and other environments

  • Lead incident response and post-mortem analyses, establishing best

practices for problem resolution

  • Design and implement disaster recovery strategies and ensure regular

testing

  • Collaborate with development teams and other stakeholders to

implement SLAs for critical services

  • Optimize cloud infrastructure for performance, reliability, and cost-

efficiency

  • Develop and maintain automation for deployment, scaling, and recovery

procedures

  • Run and maintain our infrastructure with cookbooks using Terraform,

GitLab CI/CD, and Kubernetes

  • Responding to on-call incidents

Required Skills & Experience
4+ years of experience in SRE, DevOps, or similar roles2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13.

Work in a variety of languages: Shell, Chef (recipes, cookbooks) and
Ansible (basic syntax, tasks, playbooks), Python

Strong experience in AWS related services: Cognito EC2, EKS, RDS,
CloudWatch, etc.,

Proficient in Kubernetes administration and operations in production
environments

Experience with infrastructure as code using tools like Terraform or
CloudFormation

Strong scripting skills with Python, Bash, or similar languages

Deep understanding of observability tools such as Prometheus, Grafana,
ELK stack, and distributed tracing systems

Provisioning and setup of metric in Prometheus, Grafana and alerts;

Provision and setup logs and queries for general questions

Experience with PostgreSQL or similar database systems, including
replication strategies

Knowledge of network protocols, load balancing, and security best
practices

Experience with CI/CD pipelines and Git Ops workflows

Ability to manage and prioritize multiple incidents under pressure

Exposure to Observability solutions like Splunk, Datadog, Dynatrace

Preferred Qualifications

  • AWS Certified Solutions Architect or DevOps Engineer certification
  • Certified Kubernetes Administrator (CKA) certification

Job Type: Full-time

Pay: Up to ₹1,300,000.00 per year

Experience:

  • SRE: 4 years (Preferred)
  • Terraform: 1 year (Preferred)
  • AWS: 1 year (Preferred)

Work Location: In person

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Wits Innovation Lab logo
Wits Innovation Lab

Technology/Innovation

Johannesburg

RecommendedJobs for You