Site Reliability Engineer

7 - 12 years

20.0 - 32.5 Lacs P.A.

Chennai, Gurgaon, Mumbai (All Areas)

Posted:2 months ago| Platform: Naukri logo

Apply Now

Skills Required

AzureAks KubernetesLinuxTerraformPythonCi Cd PipelineBash ScriptingShell ScriptingAksJenkinsDockerAnsibleSite Reliability EngineeringKubernetes

Work Mode

Hybrid

Job Type

Full Time

Job Description

Job Description: We are looking for a seasoned Site Reliability Engineer with a strong background in Azure and Python. Youll manage and deploy critical AKS clusters, ensure the availability and scalability of systems, and automate key operations. Your work will focus on maintaining high uptime, optimizing infrastructure, and integrating observability platforms for proactive monitoring. Responsibilities: Deploy and manage AKS clusters on Azure, including base image updates and IaC testing. Develop and automate processes using Python, Terraform, and Helm to maintain a scalable production environment. Conduct disaster recovery testing, analyze logs, and respond to incidents to ensure minimal downtime. Focus on achieving five-9's availability targets and managing the operations of large-scale systems. Collaborate with cross-functional teams to design, monitor, and optimize infrastructure and applications. Skills: Strong experience in Python programming and automation scripting. Expertise in Kubernetes, Terraform, Helm, ArgoCD, and CI/CD tools like GitHub Actions. In-depth knowledge of cloud networking, Linux systems, and observability tools (ELK, Grafana Loki). Solid experience with Infrastructure as Code (IaC) and monitoring/logging platforms like OpenTelemetry. Education/Experience: Bachelor’s degree in Computer Science, IT, or related field. 7+ years of experience in site reliability engineering, with at least 3 years directly managing AKS or similar cloud environments. Prior experience working in regulated environments is a plus. Mandatory Skills: Strong expertise with Azure, Terraform, Kubernetes with Helm, and GitOps (ArgoCD). Python or Go programming, ELK/Grafana for monitoring/logging. Proficiency in Linux and cloud networking.

RecommendedJobs for You

Chennai, Pune, Delhi, Mumbai, Bengaluru, Hyderabad, Kolkata

Pune, Bengaluru, Mumbai (All Areas)

Chennai, Pune, Delhi, Mumbai, Bengaluru, Hyderabad, Kolkata

Bengaluru, Hyderabad, Mumbai (All Areas)

Hyderabad, Gurgaon, Mumbai (All Areas)