Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
0.0 years
0 Lacs
hyderabad, telangana, india
Remote
Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone. Job Description What you get to do in this role: Provide relief and sustainable resolut...
Posted 1 week ago
4.0 - 6.0 years
12 - 14 Lacs
gurugram
Work from Office
Responsibilities: * Design, implement & maintain reliable systems using Python automation. * Collaborate with cross-functional teams on SLI/SLO creation & monitoring.
Posted 1 week ago
2.0 - 4.0 years
0 Lacs
bengaluru, karnataka, india
On-site
Job title : DataDog Engineer Location : Bangalore, India Duration : 12 months Job Description : About the Role We are looking for a skilled Datadog Engineer to design, implement, and maintain monitoring and observability solutions using Datadog. This role focuses on ensuring system reliability, performance, and proactive incident detection across cloud and on-prem environments. Key Responsibilities Deploy and configure Datadog agents for infrastructure, applications, and services. Create dashboards, alerts, monitors, log pipelines, parsing rules for real-time visibility into system health. Integrate Datadog with CI/CD pipelines and automation tools. Implement APM and distributed tracing for ...
Posted 2 weeks ago
4.0 - 8.0 years
0 Lacs
pune, maharashtra, india
Remote
Job Description Job Title: Observability & Monitoring Engineer Location: India Department: Employee Services Technology & Operations (ESTO) ITSM & Service Operations Love making complex systems feel simple and reliable We're looking for an Observability & Monitoring Engineer who is equal parts builder and detectivesomeone who instruments services end-to-end, shines a light on blind spots, and turns noise into actionable signals. You'll help us evolve a modern RunOps capability that improves reliability, reduces toil, and elevates the employee experience across Zendesk. About Zendesk At Zendesk, we believe outstanding customer and employee experiences start with great service and resilient pl...
Posted 2 weeks ago
2.0 - 4.0 years
0 - 3 Lacs
gurugram
Hybrid
What You Will Be Doing: As a Site Reliability Engineer, you'll use your advanced development and operations knowledge to identify and prioritize issues. Find universal solutions to common problems and mentor and support junior staff. Additionally, you will: Design, implement, and maintain reliable cloud infrastructure using AWS services. Develop and enhance CI/CD pipelines to accelerate software delivery. Build automation for deployment, monitoring, scaling, and incident recovery. Collaborate with development teams to embed observability and reliability best practices. Identify and remediate performance bottlenecks, scaling challenges, and operational risks. Utilize AI-based developer produc...
Posted 3 weeks ago
5.0 - 9.0 years
0 Lacs
pune, maharashtra
On-site
Role Overview: As a senior SRE / Observability Engineer at PTC, you will be part of the Atlas Platform Engineering team. Your main responsibilities will include creating and maintaining observability standards and best practices, reviewing and enhancing the observability platform, designing and implementing monitoring solutions for complex distributed systems, supporting the ongoing evaluation of new capabilities, providing technical leadership to the observability team, collaborating with R&D and product development teams, conducting training sessions, and assisting in automating monitoring and alerting processes. Key Responsibilities: - Create and maintain observability standards and best ...
Posted 3 weeks ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
As an ideal candidate for the role, you will be responsible for the following key aspects: Role Overview: You will play a crucial role in Infrastructure & Platform Modernization, focusing on lead infrastructure automation using tools like Terraform, Pulumi, AWS CloudFormation, etc. Your responsibilities will include designing and maintaining scalable, secure, and compliant cloud-native infrastructure on AWS, Azure, or GCP. Additionally, you will build reusable IaC modules, enforce infrastructure standards, and drive cloud platform modernization. Key Responsibilities: - Lead infrastructure automation leveraging Infrastructure-as-Code (IaC) tools such as Terraform, Pulumi, AWS CloudFormation, ...
Posted 3 weeks ago
4.0 - 8.0 years
0 Lacs
karnataka
On-site
Role Overview: Smarsh is looking for a rigorous, problem-solving, and curious Platform Engineer (who codes!) to join the Fabric Insight group within the Observability team. The Fabric teams at Smarsh combine software and systems engineering to build and run products that equip engineering teams with secure tools and infrastructure. As a Platform Engineer, you will have a key role in shaping the future of the platform by developing tooling and providing technical expertise to design, deploy, and optimize services in a compliant and cost-effective manner in the cloud. The ideal candidate will have a programming background in a cloud environment, a strong understanding of cloud automation, Obse...
Posted 1 month ago
4.0 - 7.0 years
10 - 20 Lacs
hyderabad
Hybrid
Reliability Engineering • Define and measure SLOs, SLIs, and error budgets for key services. • Reduce operational load through automation and intelligent alerting. • Lead and participate in blameless post-incident reviews; turn learnings into improvements. • System Design & Scalability. Write tools and services in Python, Go, Save or similar languages to automate deployment, monitoring, and recovery.(Any scripting) • Build self-healing and auto-scaling systems. • Maintain high-quality documentation and runbooks through code generation and automation. • Observability & Incident Response • Develop deep insight into system performance through metrics, tracing, and logs. • Improve mean time to d...
Posted 1 month ago
5.0 - 9.0 years
0 Lacs
karnataka
On-site
As the Service Manager for Backup & Storage, your main responsibilities will include: - **Service Ownership & Strategy**: - Own the service vision and roadmap, aligning with the Digital Platform strategy and enterprise capability goals. - Translate strategy into measurable objectives and vendor requirements. - Continuously evolve the service based on performance data, incident learnings, and stakeholder feedback to improve stability, restore times, and user experience. - **Governance & Vendor Accountability**: - Lead the governance framework for outsourced/managed services including contractual follow-up, reviews, risk management, and financials. - Ensure suppliers are accountable for resili...
Posted 1 month ago
4.0 - 9.0 years
20 - 30 Lacs
mumbai, mumbai suburban, mumbai (all areas)
Hybrid
ISS STOXX is looking for a Senior Site Reliability Engineer to join our team in Mumbai (Goregaon East), India. Shift hours: Working hours (10 AM IST to 7 PM IST). This role expects rotational on-call support 24X7. Overview: This role is critical in ensuring the reliability, scalability and performance of our systems and services. As an Senior SRE, you will work at the intersection of software engineering and infrastructure, and driving continuous improvements in availability and performance. This is a high impact role ideal for individual who thrive in fast paced environments and enjoy solving complex technical challenges. Responsibilities: Assist the Principal SRE in driving the architectur...
Posted 1 month ago
10.0 - 16.0 years
0 Lacs
pune
Work from Office
Role & responsibilities Experience as SRE engineer in technology organization driving projects by defining and creation of CUJ, SLO, SLI, Error Budgeting. Strong experience in setting up observability, monitoring and self-healing solutions for instance with Grafana, Google Cloud Operations and Ansible / Terraform or comparable tools. Strong Knowledge on IAAC Terraform, GitHub, Docker Images Good understanding of SCM Tools Git, GitHub, SonarQube, Snyk Strong Hands on in scripting like Bash , PowerShell, Python , Ansible String Understanding of CI/CD pipelines, creation and maintenance, e.g., Cloud Build , Github, and GitHub Actions, Docker. Proactive attitude and collaborative Team player min...
Posted 2 months ago
6.0 - 11.0 years
15 - 25 Lacs
bengaluru
Hybrid
Primary Responsibilities Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale. Knowledge &Skills In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc. Should have experience in an...
Posted 2 months ago
6.0 - 11.0 years
15 - 25 Lacs
pune
Hybrid
Primary Responsibilities Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale. Knowledge &Skills In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc. Should have experience in an...
Posted 2 months ago
10.0 - 15.0 years
20 - 30 Lacs
hyderabad
Hybrid
About TechBlocks TechBlocks is a global digital product engineering company with 16+ years of experience helping Fortune 500 enterprises and high-growth brands accelerate innovation, modernize technology, and drive digital transformation. From cloud solutions and data engineering to experience design and platform modernization, we help businesses solve complex challenges and unlock new growth opportunities. At TechBlocks, we believe technology is only as powerful as the people behind it. We foster a culture of collaboration, creativity, and continuous learning, where big ideas turn into real impact. Whether you're building seamless digital experiences, optimizing enterprise platforms, or tac...
Posted 2 months ago
5.0 - 15.0 years
0 Lacs
karnataka
On-site
As a DevOps SRE Engineer at LTIMindtree, your role will involve monitoring, observability, reliability, SLI, and SLO. You will be responsible for utilizing monitoring tools such as Prometheus, Grafana, Dynatrace, Splunk, New Relic, and Datadog to ensure the smooth operation and performance of the systems. Qualifications required for this role include: - Overall 5-15 years of IT experience - Minimum of 5+ years of experience as a DevOps SRE Engineer If interested, please share your updated profile to madhuvanthi.s@ltimindtree.com.,
Posted 2 months ago
5.0 - 12.0 years
0 Lacs
bangalore, karnataka
On-site
Role Overview: As an experienced hands-on Cloud SRE Manager at Palo Alto Networks, you will lead high-severity incident and problem management across GCP-centric platforms. Your role involves a combination of deep technical troubleshooting and process ownership to ensure rapid recovery, root cause elimination, and long-term reliability improvements. You will be responsible for L3 OnCall duties, driving post-incident learning, and advocating for automation and operational excellence. Key Responsibilities: - Implement and lead post-mortem processes within SLAs, identify root causes, and drive corrective actions to reduce repeat incidents. - Rapidly diagnose and resolve failures across Kubernet...
Posted 2 months ago
0.0 years
4 Lacs
bengaluru, karnataka, india
On-site
Responsibilities : Evaluate and ensure availability of components within their teams and identify how to bring all services within SLO (99.XX) Monitor systems for implemented automation and set SLI/SLOs along with respective stakeholders. Implementation of observability platform Review all ownership data and ensure it is current and complete. Review volume and accuracy of bugs assigned to the team and identify opportunities to improve automated triage. Identify CFBT (Customer Flow Based Testing) eligible flows, develop CFBT tests and train the team on how to write and maintain them. Lead post postmortems for any P1 or greater incidents during the rotation. Train the team on distributed probl...
Posted 3 months ago
8.0 - 10.0 years
0 Lacs
pune, maharashtra, india
On-site
Position Overview Job Title: Senior Software Engineer (Typescript developer) Corporate Title: AVP Location: Pune, India Role Description You will be joining the TDI Engineering Platforms and Practice group as a full stack developer working on our target state secure pipelines and control automation stack. The pipeline is a key component in providing a frictionless software delivery experience for our customers and will be used by the entire organization. You will be responsible for designing, building and supporting a variety of automation including GitHub Actions and Workflows and backend process (Java/TypeScript) ensuring the highest standards of compliance without hindering the pace of de...
Posted 3 months ago
5.0 - 9.0 years
0 Lacs
hyderabad, telangana
On-site
As an Infrastructure & Platform Modernization Lead, you will be responsible for: - Leading infrastructure automation using Infrastructure-as-Code (IaC) tools such as Terraform, Pulumi, AWS CloudFormation, etc. - Designing and maintaining scalable, secure, and compliant cloud-native infrastructure on AWS, Azure, or GCP. - Building reusable IaC modules to enforce infrastructure standards across environments. - Driving cloud platform modernization and legacy system transformation. - Developing containerization and orchestration strategies with Docker and Kubernetes. - Implementing infrastructure testing and validation in CI/CD pipelines. In the Production Operations & Application Support role, ...
Posted 3 months ago
6.0 - 11.0 years
15 - 25 Lacs
pune
Hybrid
Primary Responsibilities Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale. Knowledge &Skills In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc. Should have experience in an...
Posted 3 months ago
6.0 - 11.0 years
15 - 25 Lacs
bengaluru
Hybrid
Primary Responsibilities Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale. Knowledge &Skills In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc. Should have experience in an...
Posted 3 months ago
4.0 - 9.0 years
20 - 30 Lacs
mumbai, mumbai suburban, mumbai (all areas)
Hybrid
ISS STOXX is looking for a Senior Site Reliability Engineer to join our team in Mumbai (Goregaon East), India. Shift hours: Working hours (10 AM IST to 7 PM IST). This role expects rotational on-call support 24X7. Overview: This role is critical in ensuring the reliability, scalability and performance of our systems and services. As an Senior SRE, you will work at the intersection of software engineering and infrastructure, and driving continuous improvements in availability and performance. This is a high impact role ideal for individual who thrive in fast paced environments and enjoy solving complex technical challenges. Responsibilities: Assist the Principal SRE in driving the architectur...
Posted 3 months ago
1.0 - 4.0 years
4 - 5 Lacs
bengaluru
Work from Office
1. Data Mapping: * From Booking Packing List to Booking Plan. * From the Final Packing List into the Planning Chart for Invoice Creation & Tracking. 2. E-Booking submission in Buyers/Forwarder’s Portal and handling Booking Amendments. 3. Tax Invoice Creation 4. Preparation of Sample Documents, including SLI. 5. Coordination with Factory Team for ASN submission in Buyer’s/Forwarder’s Portal.
Posted 3 months ago
5.0 - 7.0 years
15 - 20 Lacs
bengaluru
Hybrid
Job Title: Site Reliability Engineer (SRE) Experience Range: 5-7 Years Location: Bangalore / Hybrid Employment Type: Full-time About the Role We are seeking a skilled Site Reliability Engineer (SRE) to design, build, and maintain highly available, scalable, and reliable systems. The ideal candidate will have a strong background in infrastructure management, DevOps practices, cloud platforms, and observability tools . You will collaborate with cross-functional teams to ensure system stability, performance, and security while driving automation and continuous improvement initiatives. Key Responsibilities Software Development & Automation Develop, test, and maintain high-quality software soluti...
Posted 3 months ago
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
192783 Jobs | Dublin
Wipro
61786 Jobs | Bengaluru
EY
49321 Jobs | London
Accenture in India
40642 Jobs | Dublin 2
Turing
35027 Jobs | San Francisco
Uplers
31887 Jobs | Ahmedabad
IBM
29626 Jobs | Armonk
Capgemini
26439 Jobs | Paris,France
Accenture services Pvt Ltd
25841 Jobs |
Infosys
25077 Jobs | Bangalore,Karnataka