Posted:2 weeks ago|
Platform:
Work from Office
Full Time
Senior SE I / Site Reliability Engineer (SRE)
We are seeking an accomplished Senior Site Reliability Engineer (SRE) with 12-15 years of experience to lead the reliability, scalability, and performance engineering of our critical infrastructure and production systems. As a Senior SRE, you will play a strategic and technical leadership role driving reliability practices, mentoring SRE teams, and influencing the adoption of automation, observability, and resilience engineering across the organization.
You will act as a technical thought leader and hands-on engineer , collaborating with infrastructure, application, and operations teams to build, automate, and scale reliable systems that support global business operations. This role requires deep expertise in cloud platforms, automation, monitoring, incident management, and system design for large-scale distributed environments.
Architect, implement, and manage resilient, scalable, and highly available infrastructure systems.
Lead initiatives to automate manual operations, deployment, and monitoring processes to improve reliability and reduce toil.
Drive the creation of observability solutions and dashboards to proactively detect and remediate potential issues.
Lead critical incident response, ensuring swift mitigation and clear communication to stakeholders.
Conduct detailed root cause analysis (RCA) and drive permanent corrective actions to prevent recurrence.
Implement and mature incident management frameworks, including runbooks, playbooks, and post-incident reviews.
Oversee system performance, capacity planning, and scalability of infrastructure across hybrid and cloud environments (AWS, Azure, GCP).
Optimize system resource utilization, latency, and reliability through performance tuning and automation.
Work closely with architecture and platform teams to accommodate growth, change, and modernization initiatives.
Provide technical leadership and mentorship to SRE teams and cross-functional engineering groups.
Promote an SRE culture across teams championing principles of reliability, automation, observability, and continuous improvement.
Drive collaboration between development, QA, DevOps, and release teams to embed reliability into the software development lifecycle (SDLC).
Define, track, and continuously improve Service Level Objectives (SLOs) and Service Level Indicators (SLIs) .
Electronic Arts
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
hyderabad, telangana, india
Salary: Not disclosed
hyderabad
15.0 - 19.0 Lacs P.A.
hyderabad, telangana, india
Salary: Not disclosed
hyderābād
8.0 - 10.0 Lacs P.A.
hyderabad, telangana
Experience: Not specified
Salary: Not disclosed
bengaluru
4.0 - 8.0 Lacs P.A.
chennai
5.0 - 9.0 Lacs P.A.
bengaluru
4.0 - 8.0 Lacs P.A.
bengaluru
4.0 - 8.0 Lacs P.A.
17.0 - 22.5 Lacs P.A.