Site Reliability Engineer

2 - 6 years

0 Lacs

Posted:14 hours ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Site Reliability Engineer (SRE) at Logile, you will play a crucial role in ensuring the reliability, scalability, and performance of the infrastructure and applications. Your responsibilities will include: - Designing, implementing, and managing observability systems such as Prometheus, Grafana, ELK/EFK, Jaeger, Open Telemetry - Defining and maintaining SLAs, SLOs, and SLIs for services to meet reliability goals - Building automation for infrastructure, monitoring, scaling, and incident response using tools like Terraform, Ansible, and scripting (Python/Bash) - Collaborating with developers to design resilient and scalable systems following SRE best practices - Leading incident management including monitoring alerts, root cause analysis, postmortems, and continuous improvement - Implementing chaos engineering and fault-tolerance testing to validate system resilience - Driving capacity planning, performance tuning, and cost optimization across environments - Ensuring security, compliance, and governance in infrastructure monitoring You are required to have: - 2-5 years of strong experience with monitoring, logging, and tracing tools (Prometheus, Grafana, ELK, EFK, Jaeger, Open Telemetry, Loki) - Cloud expertise in AWS, Azure, or GCP monitoring and reliability practices (CloudWatch, Azure Monitor) - Proficiency in Linux system administration and networking fundamentals - Solid skills in infrastructure automation using Terraform, Ansible, and Helm - Programming/scripting skills in Python, Go, and Bash - Experience with Kubernetes and containerized workloads - Proven track record in CI/CD and DevOps practices Preferred skills include: - Experience with chaos engineering tools such as Gremlin and Litmus - Strong collaboration skills to drive SRE culture across Dev & Ops teams - Experience in Agile/Scrum environments - Knowledge of security best practices (DevSecOps) Additionally, Logile offers a competitive compensation and benefits package benchmarked against the industry standards. The standard shift for this role is from 1 PM to 10 PM, with shift allowances applicable for non-standard shifts and as per the role. Employees on shifts starting after 4 PM are eligible for food allowance/subsidized meals and cab drop, while those starting after 8 PM are also eligible for cab pickup. For more information about Logile, visit www.logile.com.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

hyderabad, telangana, india

bengaluru, karnataka, india

new delhi, hyderabad, bengaluru