Sr. Site Reliability Engineer

7 - 12 years

9.0 - 14.0 Lacs P.A.

Hyderabad

Posted:2 months ago| Platform: Naukri logo

Apply Now

Skills Required

Site Reliability EngineeringRDSS3DevOpsAutomationIAMELKEC2CI/CDVPCCloud EngineeringGrafana

Work Mode

Work from Office

Job Type

Full Time

Job Description

As an SRE , you will work with AWS, Kubernetes, Jenkins, and GitLab CI/CD to drive automation, monitoring, incident response, and performance improvements . You will contribute to both operational excellence and strategic initiatives in cloud reliability and security. WHAT THE ROLE OFFERS: Cloud Infrastructure & Reliability Engineering Ensure 99.99% uptime and reliability of TDR services through proactive monitoring and optimizations. Architect, deploy, and manage AWS cloud environments including EC2, S3, RDS, EKS, IAM, Lambda, and CloudFormation. Manage and optimize Kubernetes (EKS) clusters and containerized applications using Docker and Helm. Improve Infrastructure as Code (IaC) using Terraform, Ansible, or CloudFormation to automate cloud deployments. CI/CD & Automation Develop and maintain CI/CD pipelines in Jenkins, GitLab CI/CD, or ArgoCD for seamless software delivery. Automate infrastructure provisioning, deployments, and operational workflows . Ensure zero-downtime deployments and efficient release management. On-Call Responsibilities & Incident Management Participate in a 24/7 on-call rotation , ensuring rapid response to incidents. Investigate, diagnose, and resolve production incidents while minimizing downtime (MTTR). Conduct blameless postmortems and implement fixes to prevent future incidents. Improve SLI/SLO monitoring , alerting mechanisms, and automate incident remediation. Monitoring & Performance Optimization Implement and optimize monitoring, logging, and alerting using CloudWatch, Prometheus, Grafana, ELK, or Datadog. Enhance observability to detect anomalies and improve system performance. Optimize infrastructure costs and implement auto-scaling strategies for efficient resource utilization. Security & Compliance Ensure security best practices, including IAM policies, encryption, and network security . Automate security compliance (SOC2, ISO27001, HIPAA) and vulnerability management. Regularly patch, audit, and secure cloud environments . Collaboration & Leadership Work cross-functionally with DevOps, security, and development teams to drive reliability best practices. Mentor and coach junior engineers on SRE principles, automation, and cloud reliability . Contribute to team growth by improving operational workflows, documentation, and training . Key KPIs Contributed by this Role Uptime & Reliability: Maintain high availability (99.99%) of TDR services. Incident Resolution: Reduce MTTR (Mean Time to Resolution) through automation and improved response times. Automation & Efficiency: Enhance operational efficiency by implementing self-healing, auto-scaling, and auto-remediation . Cost Optimization: Optimize cloud spending through scalable, right-sized infrastructure . Deployment Success: Support seamless infrastructure and CI/CD-driven production deployments . WHAT YOU NEED TO SUCCEED: 7-12 years of experience in Site Reliability Engineering (SRE), DevOps, or Cloud Engineering . Expertise in AWS Cloud Hands-on experience with EC2, VPC, RDS, S3, IAM, Lambda, and EKS. Kubernetes & Containers Experience managing EKS, Helm charts, Docker, and container orchestration. CI/CD & Automation Proficiency in Jenkins, GitLab CI/CD, or ArgoCD for deployment automation. Infrastructure as Code (IaC) Strong knowledge of Terraform, Ansible, or CloudFormation. Monitoring & Logging Familiarity with CloudWatch, Prometheus, Grafana, ELK, or Datadog. Scripting & Automation Experience in Python, Shell scripting, or Golang. Incident Management & Reliability Best Practices Strong understanding of SLOs, SLIs, error budgets, and chaos engineering

Software Development
Waterloo ON +45

RecommendedJobs for You

Chennai, Pune, Delhi, Mumbai, Bengaluru, Hyderabad, Kolkata

Pune, Bengaluru, Mumbai (All Areas)

Chennai, Pune, Delhi, Mumbai, Bengaluru, Hyderabad, Kolkata

Bengaluru, Hyderabad, Mumbai (All Areas)

Hyderabad, Gurgaon, Mumbai (All Areas)