123 Site Reliability Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 7.0 years

3 - 7 Lacs

pune

Remote

We are seeking a Grafana Implementation Expert with deep expertise in Grafana and Prometheus, focusing on core development and customization rather than SRE or DevOps responsibilities. This role requires a specialist in monitoring tools, responsible for designing, developing, and optimizing Grafana dashboards, plugins, and data sources to provide real-time observability and analytics. Key Responsibilities : - Develop, customize, and optimize Grafana dashboards with advanced visualizations, queries, and alerting mechanisms.- Integrate Grafana with Prometheus and other data sources (i.e. Loki, InfluxDB, Elasticsearch, MySQL, PostgreSQL, OpenTelemetry).- Extend Grafana capabilities by developin...

Posted 2 days ago

AI Match Score
Apply

7.0 - 12.0 years

12 - 22 Lacs

gurugram, chennai, mumbai (all areas)

Hybrid

Responsibilities: Daily monitoring, incident response, and performance tuning Automation and optimization to reduce manual effort Ensuring platform reliability Supporting efforts to manage and contain increasing cloud costs Managing the rapidly growing data estate in Azure Willingness to work in on-call rotations or provide after-hours support if needed Requirements: Should have total 8+ years of experience and 4+ years of relevant experience. Knowledge of SDLC process like requirement gathering, design, implementation (Coding), testing, deployment, and maintenance. Proficient in requirement gathering and documentation Excellent communication skills verbal and written. Azure Platform Operati...

Posted 1 week ago

AI Match Score
Apply

7.0 - 11.0 years

10 - 18 Lacs

hyderabad, pune, bengaluru

Work from Office

We have an urgent need of strong Python + SRE engineer at Offshore. Kindly share profiles with hands-on experience in AWS, Kubernetes, Python, Splunk, Prometheus & Grafana. Please do share only immediate joiners and mention the candidate availability for this opportunity while sharing the profiles. Key Responsibilities Design, implement, and manage scalable and highly available cloud infrastructure on AWS or GCP. Containerize applications using Docker, and manage orchestration with Kubernetes. Collaborate with developers and QA teams to integrate CI/CD pipelines and automate deployment processes. Ensure system reliability, uptime, and performance by leveraging industry-leading monitoring too...

Posted 2 weeks ago

AI Match Score
Apply

10.0 - 14.0 years

0 Lacs

noida, uttar pradesh

On-site

Role Overview: As an Advisor, Systems Engineering at Fiserv, you will be a key member of the Open Systems Programming team responsible for ensuring the smooth operation of various DNA Data Center products. Your role will involve installing, upgrading, configuring, monitoring, troubleshooting, and maintaining the security compliance of these products. Additionally, you will play a crucial role during Go Live/Merger events, provide on-call support, and collaborate with other teams. Key Responsibilities: - Install, upgrade, and configure DNA Connect, Notifi, tMagic, and MPI products for DNA Data Center clients. - Participate in Go Live/Merger events and assist with Programming team tasks. - Mon...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

7 - 11 Lacs

lucknow

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

kanpur

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

nagpur

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

bengaluru

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

jaipur

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

noida

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

hyderabad

Remote

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

chennai

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

ahmedabad

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

mumbai

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

7 - 11 Lacs

kolkata

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

pune

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

gurugram

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 10.0 years

8 - 12 Lacs

surat

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools : logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mu...

Posted 3 weeks ago

AI Match Score
Apply

4.0 - 5.0 years

8 - 11 Lacs

gurugram

Work from Office

Position Overview : We are seeking an SRE to join our high-impact platform engineering team. You will maintain SLAs for real-time services deployed across hybrid clouds and Kubernetes clusters, contributing to automation, observability, and availability goals. Roles and Responsibilities : - Monitor application and infrastructure metrics; build dashboards and alerts (Prometheus, Grafana, ELK). - Automate health checks, incident remediation, and reliability guardrails. - Manage on-call rotations, conduct root cause analysis, and implement postmortem action plans. - Define and track SLOs, SLIs, and error budgets. - Use chaos engineering and resilience testing to ensure fault tolerance. Must Hav...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 7.0 years

17 - 22 Lacs

hyderabad

Work from Office

The ideal candidate is a Senior Site Reliability Engineer with strong expertise in CI/CD pipeline design, infrastructure automation, and backend service development. They have hands-on experience with Node.js, Python scripting, and managing large-scale Kubernetes clusters. The candidate is well-versed in AWS cloud infrastructure, including AWS CDK, and has a deep understanding of DevOps and security best practices. Familiarity with ArgoCD, Kustomize, and GitOps workflows is a strong advantage. They should also be capable of monitoring and optimizing system performance, ensuring reliability and scalability across environments, and collaborating with cross-functional teams. Responsibilities : ...

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 9.0 years

12 - 16 Lacs

pune

Work from Office

We are on the lookout for a hands-on DevOps / SRE expert who thrives in a dynamic, cloud-native environment! Join a high-impact project where your infrastructure and reliability skills will shine. Key Responsibilities : - Design & implement resilient deployment strategies (Blue-Green, Canary, GitOps) - Manage observability tools: logs, metrics, traces, and alerts - Tune backend services & GKE workloads (Node.js, Django, Go, Java) - Build & manage Terraform infra (VPC, CloudSQL, Pub/Sub, Secrets) - Lead incident responses & perform root cause analyses - Standardize secrets, tagging & infra consistency across environments - Enhance CI/CD pipelines & collaborate on better rollout strategies Mus...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 7.0 years

14 - 18 Lacs

bengaluru

Work from Office

The ideal candidate is a Senior Site Reliability Engineer with strong expertise in CI/CD pipeline design, infrastructure automation, and backend service development. They have hands-on experience with Node.js, Python scripting, and managing large-scale Kubernetes clusters. The candidate is well-versed in AWS cloud infrastructure, including AWS CDK, and has a deep understanding of DevOps and security best practices. Familiarity with ArgoCD, Kustomize, and GitOps workflows is a strong advantage. They should also be capable of monitoring and optimizing system performance, ensuring reliability and scalability across environments, and collaborating with cross-functional teams. Responsibilities : ...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

haryana

On-site

As an Incident Response Analyst at Cosm, you will play a crucial role in monitoring and ensuring the performance of Cosm's infrastructure to maintain Site Reliability for Live Entertainment Venues and Live Broadcasts. Your responsibilities will include independently monitoring and managing technical operations, leading the diagnosis and resolution of critical incidents, serving as a primary point of contact for high-impact incidents, collaborating with engineering for incident remediations, generating regular incident reports, coordinating upgrades and planned activities, providing mentorship to team members, contributing to process enhancements, working closely with field services teams, co...

Posted 4 weeks ago

AI Match Score
Apply

10.0 - 12.0 years

0 Lacs

pune, maharashtra, india

On-site

Role Responsible for planning, designing and implementing technical architectural and engineering DevOps solutions for AWS hosted products along with managing infrastructure (including its security) effectively. What will your job look like Design, architect, and deliver DevOps solutions with high quality & on-time. Design, Deployment and Operation of web-based solutions and support tools, enabling development team for one click deployment. Adhere to infrastructure/platform security and security audits/compliances & address the observations quickly. Monitor infrastructure including proactive capacity & cost management and replication strategies. Implement AWS automation through scripting (co...

Posted 1 month ago

AI Match Score
Apply

5.0 - 9.0 years

6 - 9 Lacs

chennai

Work from Office

Mid-Level SRE/DevOps Engineer (C2H) | Onsite - Coimbatore Azure DevOps Automate infra with Terraform (IaC) Monitor & optimize systems using Datadog, Prometheus, Grafan Position: Mid-Level SRE/DevOps Engineer Experience: 5-6 Years Openings: 3 Location: Coimbatore (Onsite) Engagement Type: Contract-to-Hire (C2H) Contract Duration: 6 months to 1 year (based on experience & skillset) Shifts: 6.00 a.m 2.00 p.m 2.00 p.m 10.00 p.m About the Role We are looking for passionate Site Reliability Engineers (SRE) / DevOps Engineers with strong expertise in cloud platforms, infrastructure automation, container orchestration, monitoring, and disaster recovery. This is an onsite role in Coimbatore, requirin...

Posted 1 month ago

AI Match Score
Apply
Page 1 of 5
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies