Site Reliability Engineer

1 years

12 - 20 Lacs

Posted:7 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

This role is for one of the Weekday's clients

Salary range: Rs 1200000 - Rs 2000000 (ie INR 12-20 LPA)

Min Experience: 1 yearsLocation: BengaluruJobType: full-timeAs an SRE, you will work closely with product engineering, DevOps, and platform teams to build resilient services, improve deployment processes, and drive operational excellence across the organization. You will be responsible for maintaining the health of our applications and infrastructure, strengthening reliability practices, and ensuring optimal system performance.

Requirements

Key Responsibilities

  • Reliability & Performance
  • Ensure high availability, resilience, and performance of production systems.
  • Conduct root cause analysis, implement long-term fixes, and reduce recurring incidents.
  • Develop and tune SLIs, SLOs, and error budgets in collaboration with engineering teams.
  • Infrastructure & Operations
  • Build, maintain, and optimize cloud infrastructure (AWS/Azure/GCP).
  • Implement Infrastructure-as-Code (IaC) using tools like Terraform, CloudFormation, or similar.
  • Manage compute, storage, networking, load balancers, and container orchestration systems.
  • Maintain CI/CD pipelines to streamline deployments and operational workflows.
  • Automation & Tooling
  • Automate operational tasks, scaling, failover, monitoring, and configuration management.
  • Develop tooling to improve engineering efficiency and reduce manual interventions.
  • Implement proactive alerting, self-healing mechanisms, and automated remediation workflows.
  • Observability & Incident Management
  • Build end-to-end observability using logs, metrics, traces, and dashboards.
  • Respond to production issues, participate in on-call rotations, and manage incident lifecycle.
  • Establish and improve incident response processes, runbooks, and reliability best practices.
  • Collaboration & Continuous Improvement
  • Partner with developers to design reliable architectures and production-ready solutions.
  • Promote SRE principles, reliability mindset, and performance culture across teams.
  • Contribute to capacity planning, cost optimization, and system scalability initiatives.

Required Skills & Qualifications

  • 1-4 years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
  • Strong understanding of SRE fundamentals, including SLIs, SLOs, error budgets, and operational maturity models.
  • Hands-on experience with cloud platforms (AWS/Azure/GCP) and distributed systems.
  • Proficiency in Linux systems, OS internals, networking concepts, and performance troubleshooting.
  • Experience with Infrastructure-as-Code tools such as Terraform, CloudFormation, or Pulumi.
  • Familiarity with containerization (Docker) and orchestration (Kubernetes).
  • Good understanding of CI/CD pipelines, Git workflows, and release engineering.
  • Exposure to monitoring and observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
  • Scripting proficiency in Bash, Python, or Go (preferred).
  • Strong analytical, problem-solving, and incident-management skills

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

bangalore urban, karnataka, india

hyderabad, telangana, india

kolkata, hyderabad, chennai, bengaluru

hyderabad, chennai, bengaluru