125 Site Reliability Jobs - Page 5

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

6.0 - 8.0 years

13 - 18 Lacs

Gurugram

Work from Office

Responsibilities : - Define and enforce SLOs, SLIs, and error budgets across microservices - Architect an observability stack (metrics, logs, traces) and drive operational insights - Automate toil and manual ops with robust tooling and runbooks - Own incident response lifecycle: detection, triage, RCA, and postmortems - Collaborate with product teams to build fault-tolerant systems - Champion performance tuning, capacity planning, and scalability testing - Optimise costs while maintaining the reliability of cloud infrastructure Must have Skills : - 6+ years in SRE/Infrastructure/Backend related roles using Cloud Native Technologies - 2+ years in SRE-specific capacity - Strong experience with...

Posted 4 months ago

AI Match Score
Apply

5.0 - 8.0 years

4 - 7 Lacs

Bengaluru

Work from Office

Key Responsibilities Building software Applications Is responsible to build software applications by using relevant development languages and applying knowledge of systems, services and tools appropriate for the business area and guide more junior members of the team in this topic.Is responsible to refactor and simplify code by introducing design patterns when necessary and guide more junior members of the team in this topic.Is responsible to ensure the quality of the application by following standard testing techniques and methods that adhere to the test strategyIs responsible to write readable and reusable code by applying standard patterns and using standard librariesIs responsible to mai...

Posted 5 months ago

AI Match Score
Apply

7.0 - 12.0 years

18 - 22 Lacs

Pune

Work from Office

We are looking for a highly skilled Site Reliability Engineer (SRE) with strong engineering and architectural expertise to design, implement, and manage large-scale, mission-critical infrastructure across multiple data centers and cloud providers. As an SRE, you will be responsible for architecting and optimizing our global infrastructure, enabling development teams to roll out new features efficiently while maintaining high availability and reliability. You will be hands-on with automation, performance tuning, infrastructure scalability, and cloud-native technologies to ensure a seamless user experience for millions of customers. Key Responsibilities : 1. Architect and implement highly scal...

Posted 5 months ago

AI Match Score
Apply

5.0 - 8.0 years

13 - 17 Lacs

Gurugram

Work from Office

POSITION SUMMARY : In this role, you will play a crucial part in shaping the firm's infrastructure reliability and efficiency by implementing robust Site Reliability Engineering practices. Your contribution will be pivotal in ensuring the availability, scalability, and performance of our systems and applications. Leveraging your strong technical skills and expertise in DevOps principles, you will work towards enhancing the reliability of our infrastructure and minimizing downtime, thus enabling the organization to deliver high-quality software with maximum efficiency EXPERIENCE AND REQUIRED SKILL SETS : - Ensure 24-7 uptime and stability of production systems - Investigate and troubleshoot p...

Posted 5 months ago

AI Match Score
Apply

5.0 - 7.0 years

3 - 7 Lacs

Pune

Remote

We are seeking a Grafana Implementation Expert with deep expertise in Grafana and Prometheus, focusing on core development and customization rather than SRE or DevOps responsibilities. This role requires a specialist in monitoring tools, responsible for designing, developing, and optimizing Grafana dashboards, plugins, and data sources to provide real-time observability and analytics. Key Responsibilities : - Develop, customize, and optimize Grafana dashboards with advanced visualizations, queries, and alerting mechanisms.- Integrate Grafana with Prometheus and other data sources (i.e. Loki, InfluxDB, Elasticsearch, MySQL, PostgreSQL, OpenTelemetry).- Extend Grafana capabilities by developin...

Posted 5 months ago

AI Match Score
Apply

3.0 - 8.0 years

16 - 20 Lacs

Mumbai

Work from Office

What will you do at Fynd? - Run the production environment by monitoring availability and taking a holistic view of system health. - Improve reliability, quality, and time-to-market of our suite of software solutions - Be the 1st person to report the incident. - Debug production issues across services and levels of the stack. - Envisioning the overall solution for defined functional and non-functional requirements, and being able to define technologies, patterns and frameworks to realise it. - Building automated tools in Python / Java / GoLang / Ruby etc. - Help Platform and Engineering teams gain visibility into our infrastructure. - Lead design of software components and systems, to ensure...

Posted 5 months ago

AI Match Score
Apply

6.0 - 10.0 years

13 - 17 Lacs

Hyderabad

Remote

Mode of Interview : 2-3 rounds (Virtual/Inperson) Notice : Immediate - 15 Days Max Technical Skill Requirements : ServiceNow Business Analyst, ITIL, ITSM, Dashboard Creation, APM, Scripting, Datadog Role and Responsibilities : - 6+ Years of experience into SRE Engineer , having thorough knowledge on ITIL/ITSM process - Certification in ITIL v4 framework and deep knowledge of ITSM platforms preferable - Hands on experience on APM tool Datadog - Demonstrable ability to implement complex process workflows, and evidence performance through metrics-driven reporting - Strong understanding of IT Operations - Strong written and verbal communication skills with the ability to understand and present c...

Posted 5 months ago

AI Match Score
Apply

8.0 - 13.0 years

15 - 25 Lacs

Hyderabad

Work from Office

Greetings from AIS!! AIS (Applied Information Sciences) is a highly regarded software and systems engineering firm providing professional application development services to commercial and government clients since 1982. One of Microsofts oldest and largest Managed Gold partners in the U.S., AIS is exclusively focused on building enterprise-class custom applications using Microsoft technologies. As we continue to experience extraordinary growth, we are seeking professionals to join our AIS Team in India. For more information, please visit: http://www.ais.com https://www.ais.com/blog/ Job Summary: Role: Site Reliability Engineer Mode of Hire: Full-time / Contract opportunity Responsibilities T...

Posted 5 months ago

AI Match Score
Apply

9.0 - 14.0 years

20 - 35 Lacs

Bengaluru

Work from Office

Lead automation and expense management initiatives across global network platforms. Ensure secure, cost-effective operations, enhance reliability via SRE practices, and oversee vendor TEM performance, reporting, and billing accuracy. Required Candidate profile Exp in network automation, CI/CD, and cost governance. Skilled in SRE, telecom expense management, circuit cleanup, vendor coordination, and performance reporting using Power BI and Microsoft 365.

Posted 5 months ago

AI Match Score
Apply

10.0 - 18.0 years

30 - 45 Lacs

Bengaluru

Work from Office

Lead and support RF, Voice/IPT, telephony, and mobile infrastructure globally. Drive innovation, reliability, and automation across network platforms, ensuring secure, scalable, and high-performance communication systems. Required Candidate profile Experienced in RF design, VOIP/IPT systems, UC tools, wireless/mobility, and SRE practices. Skilled in Tier-3 support, automation, and vendor management.

Posted 5 months ago

AI Match Score
Apply

5 - 10 years

7 - 12 Lacs

Bengaluru

Work from Office

Engineering Manager - Site Reliability The role of Engineering Manager - Site Reliability , is to primarily manage, mentor and develop a team of Site Reliability Engineers, ensuring the development of both (the individual and team as a whole) are in line with organizational objectives and direction. Manages all activities in scope through the direction of activities, to design new products and modify existing designs, ensuring that deliverables are on time and with acceptable quality. The role holder is required to analyze technology trends, human resource needs, and market demand to plan projects to ensure resilience in line with current demand and future ambition. In addition to this, the ...

Posted 5 months ago

AI Match Score
Apply

3 - 8 years

19 - 22 Lacs

Kolkata, Hyderabad, Pune

Work from Office

Experienced in .NET (3–5 yrs), DevOps/SRE (3+ yrs), CI/CD, Git, IaC, Agile, cloud-native apps, observability, KQL/SQL, and cross-functional DevOps solutions in production environments. Mail:kowsalya.k@srsinfoway.com

Posted 5 months ago

AI Match Score
Apply

5 - 9 years

22 - 27 Lacs

Pune, Chennai, Bengaluru

Hybrid

#Hiring for below position #Immediate joiner or 15 days Job Title: Senior .Net Developer Experience: 5 - 9 years Job Location: Pan India (Hybrid) Key Requirements: Proficiency in writing production code with an industry standard programming language using Agile methodologies. Proficiency practicing Infrastructure as Code and Configuration as Code techniques Proficiency managing multiple code bases in Git Proficiency creating Continuous Integration builds and deployment automation, for example CI/CD Pipelines Proficiency building Cloud Native applications in a major public cloud Proficiency implementing observability, application monitoring, and log aggregation solutions Proficiency working w...

Posted 5 months ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

surat

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

chennai

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

kolkata

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

7 - 11 Lacs

jaipur

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

7 - 11 Lacs

bengaluru

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

4.0 - 5.0 years

8 - 11 Lacs

gurugram

Work from Office

Position Overview : We are seeking an SRE to join our high-impact platform engineering team. You will maintain SLAs for real-time services deployed across hybrid clouds and Kubernetes clusters, contributing to automation, observability, and availability goals. Roles and Responsibilities : - Monitor application and infrastructure metrics; build dashboards and alerts (Prometheus, Grafana, ELK). - Automate health checks, incident remediation, and reliability guardrails. - Manage on-call rotations, conduct root cause analysis, and implement postmortem action plans. - Define and track SLOs, SLIs, and error budgets. - Use chaos engineering and resilience testing to ensure fault tolerance. Must Hav...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

pune

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

gurugram

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

mumbai

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

ahmedabad

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

hyderabad

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted Date not available

AI Match Score
Apply

6.0 - 11.0 years

6 - 16 Lacs

pune, thiruvananthapuram

Hybrid

Automation and Optimization: Develop automation scripts and tools to streamline IAM operations, including provisioning, de-provisioning, and access management. Optimize system configurations and processes to improve efficiency and reduce manual intervention. Incident Management and Response: Lead incident response efforts for IAM-related issues, including root cause analysis and resolution. Implement strategies to minimize downtime and ensure rapid recovery in the event of system failures. Collaboration and Communication: Work closely with development, operations, and security teams to ensure seamless integration and operation of IAM solutions. Communicate effectively with stakeholders regar...

Posted Date not available

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies