27 Slo Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

4.0 - 9.0 years

20 - 30 Lacs

mumbai, mumbai suburban, mumbai (all areas)

Hybrid

ISS STOXX is looking for a Senior Site Reliability Engineer to join our team in Mumbai (Goregaon East), India. Shift hours: Working hours (10 AM IST to 7 PM IST). This role expects rotational on-call support 24X7. Overview: This role is critical in ensuring the reliability, scalability and performance of our systems and services. As an Senior SRE, you will work at the intersection of software engineering and infrastructure, and driving continuous improvements in availability and performance. This is a high impact role ideal for individual who thrive in fast paced environments and enjoy solving complex technical challenges. Responsibilities: Assist the Principal SRE in driving the architectur...

Posted 1 day ago

AI Match Score
Apply

10.0 - 16.0 years

0 Lacs

pune

Work from Office

Role & responsibilities Experience as SRE engineer in technology organization driving projects by defining and creation of CUJ, SLO, SLI, Error Budgeting. Strong experience in setting up observability, monitoring and self-healing solutions for instance with Grafana, Google Cloud Operations and Ansible / Terraform or comparable tools. Strong Knowledge on IAAC Terraform, GitHub, Docker Images Good understanding of SCM Tools Git, GitHub, SonarQube, Snyk Strong Hands on in scripting like Bash , PowerShell, Python , Ansible String Understanding of CI/CD pipelines, creation and maintenance, e.g., Cloud Build , Github, and GitHub Actions, Docker. Proactive attitude and collaborative Team player min...

Posted 6 days ago

AI Match Score
Apply

6.0 - 11.0 years

15 - 25 Lacs

bengaluru

Hybrid

Primary Responsibilities Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale. Knowledge &Skills In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc. Should have experience in an...

Posted 1 week ago

AI Match Score
Apply

6.0 - 11.0 years

15 - 25 Lacs

pune

Hybrid

Primary Responsibilities Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale. Knowledge &Skills In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc. Should have experience in an...

Posted 1 week ago

AI Match Score
Apply

10.0 - 17.0 years

14 - 24 Lacs

hyderabad

Work from Office

Role & responsibilities : - Lead and mentor backend engineers while contributing code, reviews, and architectural guidance for scalable, resilient distributed services and data platforms. - Architect and implement fault-tolerant microservices with patterns such as circuit breakers, bulkheads, idempotency, and saga/transactional outbox where appropriate. - Embed application security into SDLC: threat modelling, secure design reviews, secure coding standards, dependency hygiene, and security test automation (SAST/DAST). - Apply secure coding practices in Java/Spring: validation/encoding, parameterized queries, secrets management, method-level authorization, CSRF protections, and audit logging....

Posted 1 week ago

AI Match Score
Apply

5.0 - 10.0 years

0 Lacs

pune, aurangabad, mumbai (all areas)

Work from Office

Roles and Responsibilities Design, implement, and maintain scalable and reliable infrastructure for our SaaS product using automation tools like Ansible and Terraform. Collaborate with cross-functional teams to identify areas of improvement in production support processes and develop solutions to reduce Toil (unnecessary work) through process improvements. Develop and maintain Service Level Objectives (SLAs) for critical services, ensuring high availability, latency, and error rates meet company standards. Provide 24/7 on-call support for critical systems during off-hours periods.

Posted 1 week ago

AI Match Score
Apply

10.0 - 15.0 years

20 - 30 Lacs

hyderabad

Hybrid

About TechBlocks TechBlocks is a global digital product engineering company with 16+ years of experience helping Fortune 500 enterprises and high-growth brands accelerate innovation, modernize technology, and drive digital transformation. From cloud solutions and data engineering to experience design and platform modernization, we help businesses solve complex challenges and unlock new growth opportunities. At TechBlocks, we believe technology is only as powerful as the people behind it. We foster a culture of collaboration, creativity, and continuous learning, where big ideas turn into real impact. Whether you're building seamless digital experiences, optimizing enterprise platforms, or tac...

Posted 2 weeks ago

AI Match Score
Apply

5.0 - 15.0 years

0 Lacs

karnataka

On-site

As a DevOps SRE Engineer at LTIMindtree, your role will involve monitoring, observability, reliability, SLI, and SLO. You will be responsible for utilizing monitoring tools such as Prometheus, Grafana, Dynatrace, Splunk, New Relic, and Datadog to ensure the smooth operation and performance of the systems. Qualifications required for this role include: - Overall 5-15 years of IT experience - Minimum of 5+ years of experience as a DevOps SRE Engineer If interested, please share your updated profile to madhuvanthi.s@ltimindtree.com.,

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 12.0 years

0 Lacs

bangalore, karnataka

On-site

Role Overview: As an experienced hands-on Cloud SRE Manager at Palo Alto Networks, you will lead high-severity incident and problem management across GCP-centric platforms. Your role involves a combination of deep technical troubleshooting and process ownership to ensure rapid recovery, root cause elimination, and long-term reliability improvements. You will be responsible for L3 OnCall duties, driving post-incident learning, and advocating for automation and operational excellence. Key Responsibilities: - Implement and lead post-mortem processes within SLAs, identify root causes, and drive corrective actions to reduce repeat incidents. - Rapidly diagnose and resolve failures across Kubernet...

Posted 1 month ago

AI Match Score
Apply

0.0 years

4 Lacs

bengaluru, karnataka, india

On-site

Responsibilities : Evaluate and ensure availability of components within their teams and identify how to bring all services within SLO (99.XX) Monitor systems for implemented automation and set SLI/SLOs along with respective stakeholders. Implementation of observability platform Review all ownership data and ensure it is current and complete. Review volume and accuracy of bugs assigned to the team and identify opportunities to improve automated triage. Identify CFBT (Customer Flow Based Testing) eligible flows, develop CFBT tests and train the team on how to write and maintain them. Lead post postmortems for any P1 or greater incidents during the rotation. Train the team on distributed probl...

Posted 1 month ago

AI Match Score
Apply

10.0 - 12.0 years

0 Lacs

pune, maharashtra, india

On-site

Siemens Digital Industries Software is a leading provider of solutions for the design, simulation, and manufacture of products across many different industries. Formula 1 cars, skyscrapers, ships, space exploration vehicles, and many of the objects we see in our daily lives are being conceived and manufactured using our Product Lifecycle Management (PLM) software. We are seeking an experienced IAM Architect with deep expertise in designing and implementing Identity and Access Management (IAM) solutions at scale. The ideal candidate should have strong architectural experience in Java/Spring Bootbased microservices , along with proven expertise in security standards such as SAML, OIDC, SCIM, a...

Posted 1 month ago

AI Match Score
Apply

4.0 - 6.0 years

3 - 5 Lacs

noida, greater noida

Work from Office

Required Skills and Experience Having 5+ Years of experience in Service Level Management (SLM) Define and manage service level objectives (SLOs) and ensure alignment with CUSTOMER expectations. Monitor and evaluate service performance against agreed SLAs on a monthly basis. Identify and report service level defaults when performance metrics fall below expected thresholds. Administer service level credits (liquidated damages) for SLA breaches, calculated based on the percentage of calls outside acceptable service levels. Facilitate earn-back mechanisms when performance exceeds SLA targets, subject to CUSTOMER discretion. Ensure SLA compliance across all in-scope services including incident, c...

Posted 1 month ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

hyderabad, telangana

On-site

As an Infrastructure & Platform Modernization Lead, you will be responsible for: - Leading infrastructure automation using Infrastructure-as-Code (IaC) tools such as Terraform, Pulumi, AWS CloudFormation, etc. - Designing and maintaining scalable, secure, and compliant cloud-native infrastructure on AWS, Azure, or GCP. - Building reusable IaC modules to enforce infrastructure standards across environments. - Driving cloud platform modernization and legacy system transformation. - Developing containerization and orchestration strategies with Docker and Kubernetes. - Implementing infrastructure testing and validation in CI/CD pipelines. In the Production Operations & Application Support role, ...

Posted 1 month ago

AI Match Score
Apply

6.0 - 11.0 years

15 - 25 Lacs

pune

Hybrid

Primary Responsibilities Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale. Knowledge &Skills In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc. Should have experience in an...

Posted 1 month ago

AI Match Score
Apply

6.0 - 11.0 years

15 - 25 Lacs

bengaluru

Hybrid

Primary Responsibilities Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale. Knowledge &Skills In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc. Should have experience in an...

Posted 1 month ago

AI Match Score
Apply

7.0 - 11.0 years

0 Lacs

karnataka

On-site

Role Overview: As an Incident Commander at Palo Alto Networks, you will be playing a pivotal role in the cybersecurity landscape, dedicated to addressing critical incidents for customers and ensuring their satisfaction. Your proactive approach, dedication to continuous improvement, and passion for customer service will be key in solidifying the company's reputation as a cybersecurity partner of choice. Key Responsibilities: - Coordinate and lead response initiatives for the company's most critical incidents and escalations affecting customers - Demonstrate adept leadership and seamless coordination with global teams, making quick decisions and facilitating communication across diverse teams ...

Posted 1 month ago

AI Match Score
Apply

4.0 - 9.0 years

20 - 30 Lacs

mumbai, mumbai suburban, mumbai (all areas)

Hybrid

ISS STOXX is looking for a Senior Site Reliability Engineer to join our team in Mumbai (Goregaon East), India. Shift hours: Working hours (10 AM IST to 7 PM IST). This role expects rotational on-call support 24X7. Overview: This role is critical in ensuring the reliability, scalability and performance of our systems and services. As an Senior SRE, you will work at the intersection of software engineering and infrastructure, and driving continuous improvements in availability and performance. This is a high impact role ideal for individual who thrive in fast paced environments and enjoy solving complex technical challenges. Responsibilities: Assist the Principal SRE in driving the architectur...

Posted 1 month ago

AI Match Score
Apply

5.0 - 7.0 years

15 - 20 Lacs

bengaluru

Hybrid

Job Title: Site Reliability Engineer (SRE) Experience Range: 5-7 Years Location: Bangalore / Hybrid Employment Type: Full-time About the Role We are seeking a skilled Site Reliability Engineer (SRE) to design, build, and maintain highly available, scalable, and reliable systems. The ideal candidate will have a strong background in infrastructure management, DevOps practices, cloud platforms, and observability tools . You will collaborate with cross-functional teams to ensure system stability, performance, and security while driving automation and continuous improvement initiatives. Key Responsibilities Software Development & Automation Develop, test, and maintain high-quality software soluti...

Posted 1 month ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

karnataka

On-site

You will be responsible for the following in this role: - Operating, monitoring, and triaging all aspects of production and non-production environments. - Automating deployment and orchestration of services into the cloud environment as well as other routine processes. - Working on multiple cloud environments like AWS and GCP. - Actively participating in capacity planning, scale testing, and disaster recovery exercises. - Interacting with and supporting partner teams, including Engineering, QA, and program management. - Troubleshooting customer concerns for ML Tuning and inference endpoints on Ray. - Designing and implementing RESTful/RPC API and services using Golang OR Python. - Implementi...

Posted 1 month ago

AI Match Score
Apply

3.0 - 7.0 years

0 Lacs

chennai, tamil nadu

On-site

As a member of our team, you will be responsible for the following key tasks: You must possess a strong understanding of Dynatrace and AWS, utilizing this knowledge to effectively manage the production environment by continuously monitoring availability and maintaining a comprehensive view of system health. Demonstrate familiarity with SLA, SLO, and SLI key metrics to ensure adherence to performance standards. Collect and analyze metrics from applications to support performance optimization and troubleshooting efforts. Proficiency in AWS services including API gateway, Lambda, Kibana, Cloudwatch, Dynamo DB, and S3 is necessary. Implement automation practices to develop sustainable systems an...

Posted 2 months ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

Bengaluru, Karnataka, India

On-site

Who We Are: Wayfair runs the largest custom e-commerce large parcel network in the United States, approximately 1.6 million square meters of logistics space. The nature of the network is inherently a highly variable ecosystem that requires flexible, reliable, and resilient systems to operate efficiently. The Wayfair Operations Center team is looking for experienced engineers skilled in cloud-native design, legacy maintenance, and SRE best practicesplus ideas for improvement! We collaborate across our Technology organization to ensure platforms and services are production-ready, contributing to both platform and software codebases. What Youll Do: As a DevOps Engineer, you will join our team t...

Posted 3 months ago

AI Match Score
Apply

8.0 - 12.0 years

10 - 14 Lacs

Bengaluru

Work from Office

Job Summary: We are looking for a Junior Site Reliability Engineer (SRE) with strong Java coding and debugging skills to help maintain the reliability, performance, and scalability of our critical systems. As a Junior SRE, you will work closely with senior engineers to monitor systems, automate processes, and enhance infrastructure reliability. This role is ideal for candidates passionate about Java, DevOps, cloud technologies, and automation in a fast-paced environment. Experience: 2-4 years Key Responsibilities: System Reliability & Performance: Monitor and maintain the availability of key services and applications. Participate in defining and improving SLIs, SLOs, and SLAs for system reli...

Posted 3 months ago

AI Match Score
Apply

5.0 - 9.0 years

15 - 27 Lacs

Pune, Bengaluru, Mumbai (All Areas)

Hybrid

Role: ServiceNow SLO Consultant Location: Pune / Mumbai / Bangalore / Chennai Skills Required: Over 3 years of hands-on experience in ServiceNow Administration, Development, and Upgrade Support, including day-to-day 2nd and 3rd line support and implementation of core ServiceNow modules. Proven expertise in ServiceNow modules such as Incident Management, Problem Management, Change Management, and CMDB. Experience with the ServiceNow SLO (Supplier Lifecycle Operations) module is must. Strong analytical and scripting skills across Client and Server-side scripting, including: Demonstrated ability to support and manage ServiceNow platform upgrades, ensuring minimal disruption and full compatibili...

Posted 3 months ago

AI Match Score
Apply

12.0 - 18.0 years

35 - 60 Lacs

Hyderabad

Hybrid

Senior Manager, Site Reliability Engineering Hyderabad Shift Timings: 1.00 PM - 10.00 PM Duties and Responsibilities: People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior titles. Responsibilities: Lead and manage a team of Site Reliability Engineers, providing mentorship, guidance, and support to ensure the team's success. Develop and implement strategies for improving system reliability, scalability, and performance. Establish and enforce SRE best practices, i...

Posted 4 months ago

AI Match Score
Apply

5.0 - 8.0 years

25 - 37 Lacs

Thiruvananthapuram

Work from Office

What youll do? Design, develop, and operate high scale applications across the full engineering stack. Design, develop, test, deploy, maintain, and improve software. Apply modern software development practices (serverless computing, microservices architecture, CI/CD, infrastructure-as-code, etc.) Work across teams to integrate our systems with existing internal systems, Data Fabric, CSA Toolset. Participate in technology roadmap and architecture discussions to turn business requirements and vision into reality. Participate in a tight-knit, globally distributed engineering team. Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on networ...

Posted 4 months ago

AI Match Score
Apply
Page 1 of 2
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies