Jobs
Interviews

651 Sre Jobs - Page 17

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

12.0 - 18.0 years

35 - 60 Lacs

Hyderabad

Hybrid

Senior Manager, Site Reliability Engineering Hyderabad Shift Timings: 1.00 PM - 10.00 PM Duties and Responsibilities: People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior titles. Responsibilities: Lead and manage a team of Site Reliability Engineers, providing mentorship, guidance, and support to ensure the team's success. Develop and implement strategies for improving system reliability, scalability, and performance. Establish and enforce SRE best practices, including monitoring, alerting, error budget tracking, and post-incident reviews. Collaborate with software engineering teams to design and implement reliable, scalable, and efficient systems. Implement and maintain monitoring and alerting systems to proactively identify and address issues before they impact customers. Implement performance engineering processes to ensure reliability of Products, Services, & Platforms. Drive automation and tooling efforts to streamline operations and improve efficiency. Continuously evaluate and improve our infrastructure, processes, and practices to ensure reliability and scalability. Provide technical leadership and guidance on complex engineering projects and initiatives. Stay up-to-date with industry trends and emerging technologies in site reliability engineering and cloud computing. Other duties as assigned. Required Work Experience: 10+ years of experience in site reliability engineering or a related field. 5+ years of experience in a leadership or management role, managing a team of engineers. 5+ years of hands on working experience with Dynatrace (administrative, deployment, etc). Strong understanding of DevSecOps principles. Strong understanding of cloud computing principles and technologies, preferably AWS, Azure, or GCP. Strong communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams. Proven track record of driving projects to successful completion in a fast-paced, dynamic environment. Experience with driving cultural change in technical excellence, quality, and efficiency. Experience managing and growing technical leaders and teams. Constructing, interpreting, and applying metrics to your work and decision making, able to use those metrics to identify correlation between drivers and results, and using that information to drive prioritization and action Preferred Work Experience: Proficiency in programming/scripting languages such as Python, Go, or Bash. Experience with infrastructure as code tools such as Terraform or CloudFormation. Deep understanding of Linux systems administration and networking principles. Experience with containerization and orchestration technologies such as Docker and Kubernetes. Experience or familiarity with IIS, HTML, Java, Jboss. Knowledge: Site Reliability Engineering Principles DevSecOps Principles Agile (SAFe) Healthcare industry ITLT ServiceNow Jira/Confluence Skills: Strong communication skills Leadership Programming languages (see above) Project Management Mentorship Continuous learning

Posted 1 month ago

Apply

7.0 - 12.0 years

15 - 30 Lacs

Hyderabad

Work from Office

Site Reliability Engineer Required Technical Skill Set-: • Practical experience with Monitoring tools, such as: Grafana, Azure Monitor, Log Analytics, Network Monitoring and Alerting Tools (i.e. Big Panda). • Experience with Automation Tooling, such as: Azure Open AI, Amelia Automation, Service Now Orchestration, Power Apps / Power Platform, Python and PowerShell. Good foundational understanding of Agile Methodologies, AI/ML for automating operational initiatives and ITIL / Change Management processes. • Knowledge of core Azure Cloud computing concepts (AZ-900 Certification as a minimum requirement, with AZ-104 certification preferred). • Knowledge of Azure Chaos Studio for Chaos Engineering Minimum 5 mandate details are mandate with two or 3 liners 1. Implementing proactive remediation automation based on past issues / incidents and hypothesising use cases where an issue / incident may, thereby, automatically restoring stability, should the incident occur. 2. Track record of implementing Monitoring tooling to encompass: Health state of Infrastructure, Network, Log & Events, Performance, Capacity and Synthetic monitoring. 3. Experience in Data Correlation & Analysis and Configuring Alerts for detected issues / incidents. 4. Knowledge of Azure Open AI, and how various data sources can be integrated with the AI for data analysis, in order to initiate events based on informed decision making. 5. Experience in leading Blameless Post-Mortems, following production incidents / outages, in order to identify opportunities for improvement

Posted 1 month ago

Apply

3.0 - 5.0 years

13 - 17 Lacs

Bengaluru

Work from Office

Role Purpose The purpose of this role is to work with Application teams and developers to facilitate better coordination amongst operations, development and testing functions by automating and streamlining the integration and deployment processes Do Align and focus on continuous integration (CI) and continuous deployment (CD) of technology in applications Plan and Execute the DevOps pipeline that supports the application life cycle across the DevOps toolchain from planning, coding and building, testing, staging, release, configuration and monitoring Manage the IT infrastructure as per the requirement of the supported software code On-board an application on the DevOps tool and configure it as per the clients need Create user access workflows and provide user access as per the defined process Build and engineer the DevOps tool as per the customization suggested by the client Collaborate with development staff to tackle the coding and scripting needed to connect elements of the code that are required to run the software release with operating systems and production infrastructure Leverage and use tools to automate testing & deployment in a Dev-Ops environment Provide customer support/ service on the DevOps tools Timely support internal & external customers on multiple platforms Resolution of the tickets raised on these tools to be addressed & resolved within a specified TAT Ensure adequate resolution with customer satisfaction Follow escalation matrix/ process as soon as a resolution gets complicated or isnt resolved Troubleshoot and perform root cause analysis of critical/ repeatable issues Deliver No Performance Parameter Measure 1. Continuous Integration,Deployment & Monitoring 100% error free on boarding & implementation 2. CSAT Timely customer resolution as per TAT Zero escalation Mandatory Skills: DevOps. Experience3-5 Years.

Posted 1 month ago

Apply

7.0 - 12.0 years

7 - 17 Lacs

Hyderabad, Ahmedabad, Bengaluru

Hybrid

Job Title: Senior DevOps Site Reliability Engineer (SRE) Location: Hyderabad & Ahmedabad Employment Type: Full-Time Work Model - 3 Days from office Job Overview Dynamic, motivated individuals deliver exceptional solutions for the production resiliency of the systems. The role incorporates aspects of software engineering and operations, DevOps skills to come up with efficient ways of managing and operating applications. The role will require a high level of responsibility and accountability to deliver technical solutions. Summary: As a Senior SRE, you will ensure platform reliability, incident management, and performance optimization. You'll define SLIs/SLOs, contribute to robust observability practices, and drive proactive reliability engineering across services. Experience Required: 610 years of SRE or infrastructure engineering experience in cloud-native environments. Mandatory: Cloud: GCP (GKE, Load Balancing, VPN, IAM) Observability: Prometheus, Grafana, ELK, Datadog Containers & Orchestration: Kubernetes, Docker Incident Management: On-call, RCA, SLIs/SLOs IaC: Terraform, Helm Incident Tools: PagerDuty, OpsGenie Nice to Have: GCP Monitoring, Skywalking Service Mesh, API Gateway GCP Spanner, Scope: Drive operational excellence and platform resilience Reduce MTTR, increase service availability Own incident and RCA processes Roles and Responsibilities: Define and measure Service Level Indicators (SLIs), Service Level Objectives (SLOs), and manage error budgets across services. Lead incident management for critical production issues drive Root Cause Analysis (RCA) and postmortems. Create and maintain runbooks and standard operating procedures for high availability services. Design and implement observability frameworks using ELK, Prometheus, and Grafana; drive telemetry adoption. Coordinate cross-functional war-room sessions during major incidents and maintain response logs. Develop and improve automated System Recovery, Alert Suppression, and Escalation logic. Use GCP tools like GKE, Cloud Monitoring, and Cloud Armor to improve performance and security posture. Collaborate with DevOps and Infrastructure teams to build highly available and scalable systems. Analyze performance metrics and conduct regular reliability reviews with engineering leads. Participate in capacity planning, failover testing, and resilience architecture reviews. If you are interested , then please share me your updated resume to gopi.c@acesoftlabs.com or 9701923036

Posted 1 month ago

Apply

2.0 - 5.0 years

8 - 12 Lacs

Chennai

Work from Office

Join us in bringing joy to customer experience. Five9 is a leading provider of cloud contact center software, bringing the power of cloud innovation to customers worldwide, Living our values everyday results in our team-first culture and enables us to innovate, grow, and thrive while enjoying the journey together. We celebrate diversity and foster an inclusive environment, empowering our employees to be their authentic selves, The Team. Five9 is a leading provider of cloud software for the enterprise contact center market, bringing the power of the cloud to thousands of customers and facilitating more than three billion customer interactions annually. Since 2001, Five9 has led the cloud revolution in contact centers, helping organizations transition from legacy premise-based solutions to the cloud. Five9 provides businesses with cloud contact center software that is reliable, secure, compliant, and scalable, which is designed to create exceptional customer experiences, increase agent productivity, and deliver tangible business results, The Platform Infrastructure Team at Five9 is responsible for building and maintaining the Cloud Infrastructure that supports the development, deployment of software hosted by Five9. The platform infrastructure team provides critical Cloud infrastructure, tools and resources that enable software developers to build and deploy software more efficiently and effectively. This position is based out of one of the offices of our affiliate Acqueon Technologies in India, and will adopt the hybrid work arrangements of that location. You will be a member of the Acqueon team with responsibilities supporting Five9 products, collaborating with global teammates based primarily in the United States. Role Purpose. As part of the Cloud Platform Engineering team, you will be building Five9’s Classic on-prem and Modern SaaS platform, An ideal candidate for us is:. an experienced engineer who is passionate about building high performance cloud platforms with automation-first mindset,. a brilliant problem solver, and. a creative self-starter, How You Contribute. Be part of Cloud Platform Infrastructure Team, focused on building on-prem and hybrid-cloud solutions, Build automation capabilities towards common abstractions, tools, automation for CI/CD and progressive delivery of on-prem and Cloud Native applications, Enable all Five9 development teams with on-prem and Cloud Native developer workflow, conduct developer training and toolset to automate software delivery with a focus on Scale, HA, Design, and build secure, highly scalable, enterprise grade platform services, Document and communicate clearly of architecture and implementation solutions, Work closely with product managers, architects, testers, and development teams, Troubleshoot and support current Cloud platform in production, Expertise to Debug & Support Production issues, Skills, Competencies And Qualifications. Required:. 5+ years of professional DevOps / production operations experience, 3+ years of Cloud Native application delivery experience, Hands-on experience with Kubernetes, CI/CD tools like GitLab, TeamCity, Intimate knowledge of public cloud infrastructures (GCP Preferred, AWS, Azure), Hands-on experience working on core Cloud services – compute, storage, network, virtualization, Identity and Access Management (IAM), Experience in Infrastructure as Code (IaC), to be responsible for building robust platforms using automation, Knowledge of Linux based systems and runtimes, Development experience in one or more programming languages Python, Terraform, Java, etc,. Experience level in Current technology Stack:. Helm, K8S, Istio, GCP, AWS, GKE,EKS,Terraform,. SRE/DevOps practices or equivalent, Experience working with at least 1 or more Cloud Provider automation tools, Other Requirements. This position requires the ability to be On Call, Five9 embraces diversity and is committed to building a team that represents a variety of backgrounds, perspectives, and skills. The more inclusive we are, the better we are. Five9 is an equal opportunity employer, View our privacy policy, including our privacy notice to California residents here: https://www,five9,/pt-pt/legal, Note: Five9 will never request that an applicant send money as a prerequisite for commencing employment with Five9, Show more Show less

Posted 1 month ago

Apply

7.0 - 9.0 years

5 - 14 Lacs

Mumbai Suburban, Navi Mumbai, Mumbai (All Areas)

Hybrid

Hi All, We have an urgent opening for SRE/ Devops for one of our leading Investment Banking client in Mumbai location. Exp : 7 to 9 years Mainly to manage technical Transitions. Self Driven. SHould be able to create a good first impression. Good Communication skill to manage topics independently with onshore. Broad array of technical Knowledge mainly to Transition into production. Should have DEVOPS and SRE knowledge that can be leveraged on this assignment. If interested , please share your resumes to ashwini.shetty@kiya.ai

Posted 1 month ago

Apply

5.0 - 7.0 years

10 - 19 Lacs

Hyderabad

Work from Office

• 7+ years of experience in cloud infrastructure engineering or SRE roles. • Deep expertise in automating infrastructure using modern DevOps and IaC practices. • Proficient in building and maintaining CI/CD pipelines. • Strong background in microservices architecture and Docker. • Mid-level experience supporting Java or .NET applications. • Hands-on experience with cloud platforms such as AWS, Azure, or GCP. • Strong knowledge of networking, load balancing, and cloud security best practices. • Excellent analytical, problem-solving, and communication skills.

Posted 1 month ago

Apply

3.0 - 8.0 years

5 - 10 Lacs

Kolkata, Mumbai, New Delhi

Work from Office

Were making big foundational cloud infrastructure changes to make the experience faster, more reliable, and more scalable for our customers workloads. This role will be responsible for helping to build, maintain, and operate our new dynamic cloud infrastructure that powers all Firebolt services. About the day to day Design and implement systematic improvements to Firebolt cloud infrastructure and Engine provisioning services to make it fast, reliable, scalable and cost efficient. Collaborate with development teams across the company to improve services reliability, scalability and developer productivity. Together with an engineering team, you will share an on-call rotation and be an escalation contact for service and cloud infrastructure incidents. REQUIREMENTS BS degree in Computer Science, Engineering, or a related field or equivalent experience. 3+ years hands-on experience as a Site Reliability Engineer. 3+ years of production experience with Kubernetes including using open source solutions from the eco-system. 3+ years of proven experience as a professional developer of production software. Development experience in an object oriented programming language. We develop in Go, C++, and some Python here and there. Experience with these languages is a plus. You are willing to understand and make cross-cutting changes in the Firebolt codebase regardless of the language. Hands on experience in building and operating cloud native applications on AWS, GCP or Azure. Strong Linux fundamentals and an understanding of networking, including a variety of network protocols. Experience building and operating highly concurrent, highly available, and fault-tolerant distributed systems. A bonus if you have Understanding of application security in a cloud environment. Experience working with service mesh and multi-cluster mesh infrastructure (Cilium). Experience in monitoring a variety of different application types with a modern prometheus compatible observability stack. Experience working with CI/CD pipelines like GitHub actions. Experience working with ArgoCD, Terraform, FoundationDB, Kafka and Kubernetes operators is a plus.

Posted 1 month ago

Apply

5.0 - 10.0 years

15 - 30 Lacs

Gurugram, Delhi / NCR

Work from Office

Work Environment: This role involves rotational shifts on a weekly basis . Shift allowances will be provided as per company policy. Employees will also have the flexibility to work from home during night shifts to support convenience and continuity. Job Responsibilities: System Monitoring and Incident Management: Monitor the health and performance of critical systems, applications, and services. Respond to incidents, troubleshoot issues, and ensure timely resolution to minimize downtime and service disruptions. Automation and Scripting: Develop and maintain automation scripts and tools to streamline operational tasks, deployment processes, and infrastructure management. Infrastructure Management: Manage and scale the underlying infrastructure, including servers, cloud services, and network components. Implement best practices for configuration management, monitoring, and disaster recovery. Release Management: Collaborate with development teams to ensure smooth and reliable software releases. Participate in the design and implementation of deployment strategies. Performance Optimization: Identify performance bottlenecks and optimize the system to improve reliability and response times. Capacity Planning: Analyze system capacity and plan for future growth to meet increasing demands. Security and Compliance: Implement security best practices and ensure compliance with relevant industry standards and regulations. Collaboration and Documentation: Work closely with cross-functional teams, including developers, product managers, and operations, to ensure efficient communication and knowledge sharing. Document processes, procedures, and troubleshooting guides. On-Call Support: Participate in an on-call rotation to handle urgent issues and incidents outside regular business hours. Qualifications: Experience with Cloud Technologies: Proficiency in working with one or more cloud platforms like AWS, Google Cloud Platform, or Microsoft Azure. Programming and Scripting Skills: Strong knowledge of at least one programming language (e.g., Python, Java,) and experience with shell scripting. System Administration: Linux/Unix system hands on and good to have administration and networking concepts. Monitoring and Logging: Experience with monitoring tools such as Prometheus, Grafana, Nagios, and log management solutions like ELK stack. Infrastructure as Code (IaC): Knowledge of Infrastructure as Code tools like Terraform or CloudFormation. Automation and Configuration Management: Experience with tools like Ansible, Chef, or Puppet for automating infrastructure management. Version Control: Familiarity with version control systems like Git. Problem-Solving Skills: Ability to analyze and troubleshoot complex technical issues and can work with other teams to help and streamline Process. Communication Skills: Strong verbal and written communication skills to collaborate effectively with team members and stakeholders. KPI/Metrics: Understand Key SRE Metrics such as Availability, SLA/SLO, MTTA and MTTR Any hands on individual with BCA/MCA and B.Tech background.

Posted 1 month ago

Apply

2.0 - 7.0 years

10 - 14 Lacs

Noida

Work from Office

Mandatory skills SRE, LINUX, DevOps, Gitlab, Docker and Kubernetes We are looking for Support team member for the application which is MS Azure cloud based Application is Azure SQL Database which will source reference data from Datalake The SRE requirement for providing Infra support on Azure Cloud services So we are looking for people with Strong hands on experience on MS Azure Cloud services Must have experience on Azure SQL Database All the profiles which are shared till now doesnt have enough knowledge and experience on Azure Cloud Infra services

Posted 1 month ago

Apply

5.0 - 7.0 years

25 - 40 Lacs

Pune

Work from Office

Our world is transforming, and PTC is leading the way.Our software brings the physical and digital worlds together, enabling companies to improve operations, create better products, and empower people in all aspects of their business. Our people make all the difference in our success. Today, we are a global team of nearly 7,000 and our main objective is to create opportunities for our team members to explore, learn, and grow – all while seeing their ideas come to life and celebrating the differences that make us who we are and the work we do possible. Job Details As a senior SRE / Observability Engineer, you will be part of the Atlas Platform Engineering team and will: Create and maintain observability standards and best practices Review the current observability platform, identify areas for improvement, and guide the team in enhancing monitoring, logging, tracing, and alerting capabilities. Expand the observability stack across multiple clouds, regions, and clusters, managing all observability data. Design and implement monitoring solutions for complex distributed systems to provide deep insights into systems and services aiming at complete visibility of digital operations Supporting the ongoing evaluation of new capabilities in the observability stack, conducting proof of concepts, pilots, and tests to validate their suitability. Assist teams in creating clear, informative, and actionable dashboards to improve system visibility. Automate monitoring and alerting processes, including enrichment strategies and ML-driven anomaly detection where applicable. Provide technical leadership to the observability team with clear priorities ensuring agreed outcomes are achieved in a timely manner. Work closely with R&D and product development teams (understand their requirements and challenges) to ensure seamless visibility into system and service performance. Work closely with the Traffic Management team to identify and standardise on existing and new observability tools as part of a holistic solution Conduct training sessions and create documentation for internal teams Support the definition of SLI (service level indicators) and SLO (service level objectives) for the Atlas services. Keep track of the error budget of each service Participate in the emergency response process Conduct RCAs (root cause analysis) Help to automate repetitive tasks and reduce toil. Qualifications: People and communication qualifications Be a strong team player Have good collaboration and communication skills Ability to translate technical concepts for non-technical audiences Problem-solving and analytical thinking Technical qualifications - general: Familiarity with cloud platforms (Ideally Azure) Familiarity with Kubernetes and Istio as the architecture on which the observability and Atlas services run, and how they integrate and scale. Experience with infrastructure as code and automation Knowledge of common programming languages and debugging techniques Have a strong technical background and be hands on. Linux and scripting languages (Bash, Python, Golang). Significant Understanding of DevOps principles. Technical qualifications - observability Strong understanding of observability principles (metrics, logs, traces) Experience with APM tools and distributed tracing Proficiency in log aggregation and analysis Knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, Prometheus, Grafana, Datadog, New Relic, Sumologic, ELK Stack, or others Knowledge of Open Telemetry, including OTEL collector and code instrumentation Experience designing and building unified observability platforms that enable the use of data (metrics, logs, and traces) to determine quickly if their application or service is operating as desired. Technical qualifications – SRE Understanding of the Google SRE principles Experience in defining SLIs and SLOs Experience in performing RCAs (root cause analysis) Experience in system performance Experience in incident response Knowledge of status tools, such as Atlassian Status Page or similar Knowledge of incident management and paging tools, such as PagerDuty or similar Knowledge of ITIL (Information Technology Infrastructure Library) processes Qualifications: People and communication qualifications • Be a strong team player • Have good collaboration and communication skills • Ability to translate technical concepts for non-technical audiences • Problem-solving and analytical thinking Technical qualifications - general: • Familiarity with cloud platforms (Ideally Azure) • Familiarity with Kubernetes and Istio as the architecture on which the observability platform runs, and how they integrate and scale. • Experience with infrastructure as code and automation • Knowledge of common programming languages and debugging techniques • Have a strong technical background and be hands on. • Linux and scripting languages (Bash, Python, Golang). • Significant Understanding of DevOps principles. Technical qualifications - observability • Strong understanding of observability principles (metrics, logs, traces) • Experience with APM tools and distributed tracing • Proficiency in log aggregation and analysis • Knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, Prometheus, Grafana, Datadog, New Relic, Sumologic, ELK Stack, or others • Knowledge of Open Telemetry, including OTEL collector and code instrumentation • Experience designing and building unified observability platforms that enable the use of data (metrics, logs, and traces) to determine quickly if their application or service is operating as desired. Life at PTC is about more than working with today’s most cutting-edge technologies to transform the physical world. It’s about showing up as you are and working alongside some of today’s most talented industry leaders to transform the world around you. If you share our passion for problem-solving through innovation, you’ll likely become just as passionate about the PTC experience as we are. Are you ready to explore your next career move with us? We respect the privacy rights of individuals and are committed to handling Personal Information responsibly and in accordance with all applicable privacy and data protection laws. Review our Privacy Policy here ."

Posted 1 month ago

Apply

10.0 - 15.0 years

6 - 10 Lacs

Hyderabad

Work from Office

Infrastructure Engineering & Coud Operations (IECO) is evoving into a word-cass, coud-optimized organization focused on deivering secure, scaabe, and high-performing patforms. As we transition from co-ocated environments to modern coud soutions, we are pacing a heightened emphasis on vunerabiity management, patch compiance, and infrastructure security.As a DevOps Manager within IECO, you wi ead a team of engineers with a core mission to ensure the security and resiience of our coud infrastructure. You wi drive the impementation of robust vunerabiity and patch management programs, ensuring timey remediation of security risks whie maintaining operationa exceence. Your eadership wi be instrumenta in advancing automation, improving system reiabiity, and safeguarding customer trust.You must be a proactive, resuts-driven eader who thrives in dynamic environments. You bring a security-first mindset, a passion for continuous improvement, and the abiity to mentor and inspire high-performing teams. What you do Buid and ead a high-performing team focused on vunerabiity detection, assessment, and remediation across coud and hybrid environments.Oversee the end-to-end patch management ifecyce, ensuring timey depoyment of security patches and updates across a infrastructure components.Estabish and enforce security baseines and compiance standards, integrating them into CI/CD pipeines and infrastructure as code.Monitor and anayze vunerabiity metrics and patch compiance KPIs, using data to drive continuous improvement and risk reduction.Coaborate with Security, Risk, and Compiance teams to aign on threat inteigence, audit requirements, and remediation strategies.Lead incident response efforts reated to infrastructure vunerabiities, ensuring rapid containment and resoution.Drive automation initiatives to streamine vunerabiity scanning, patch depoyment, and compiance reporting.Provide technica eadership in coud infrastructure design, ensuring security is embedded in architecture and operations.Partner with Product Management and Appication Engineering to aign infrastructure security with product roadmaps and business goas.Manage 24/7 operations, ensuring high avaiabiity, performance, and security of critica systems.Create and maintain documentation for systems, processes and procedures to ensure knowedge sharing across teamsStay updated on industry trends and emerging technoogies What we want you to have: Bacheors degree in Computer Science, Engineering, Information Security, or reated fied (or equivaent experience).10+ years of experience in IT infrastructure, DevOps, or SRE roes with a strong focus on security and patch management.Proven experience impementing and managing vunerabiity management toos (e.g., Quays, Tenabe, Rapid7) and patch management soutions (Tanium).Hands-on experience with coud patforms (AWS, Azure, GCP) and container orchestration (Docker, Kubernetes).Famiiarity with DevSecOps practices, infrastructure as code (Terraform, Ansibe), and secure CI/CD pipeines.Strong understanding of ITIL, security frameworks (NIST, CIS), and compiance standards (SOC 2, ISO 27001).Exceent communication and eadership skis, with experience managing geographicay distributed teams.Avaiabiity for on-ca support during critica incidents or high-impact events. Stay up to date on everything Backbaud, foow us on Linkedin, X, Instagram, Facebook and YouTube Backbaud is proud to be an equa opportunity empoyer and is committed to maintaining an incusive work environment. A quaified appicants wi receive consideration for empoyment without regard to race, coor, reigion, gender, gender identity or expression, sexua orientation, nationa origin, physica or menta disabiity, age, or veteran status or any other basis protected by federa, state, or oca aw.

Posted 1 month ago

Apply

3.0 - 7.0 years

3 - 7 Lacs

Hyderabad / Secunderabad, Telangana, Telangana, India

On-site

You will be responsible for understanding requirements or SRE goals in depth from both tech and business perspectives You will provide solutions to improve reliability, including identifying and implementing mechanisms and architectures that enable fault tolerance and faster median time to respond and median time to detect You will be responsible for enhancing the incident management process, including the development of an incident prioritization matrix, triage, communication, mitigation, post-mortem analysis and implementation of corrective actions You will manage client stakeholder expectations and queries during production incidents, providing detailed technical analysis of issues and remediation plans for mitigation and prevention in future, and act as the interface for C-level executives, if or when needed You will be a liaison with client engineering teams, build trust and productive relationships with senior client stakeholders and team leads to influence them in making better decisions You will be responsible for identifying opportunities for enhancing system performance and reliability in alignment with business SLAs, SLOs, KPIs and objectives, and provide guidance and assistance to SRE teams in implementing the identified improvements As an SRE expert, you will collaborate with Thoughtworks application development leads and solution architects, recommending changes in system design and adopting best practices for improved reliability from day one You will oversee and mentor other SREs on the team, contributing to their growth and development Job qualificationsTechnical SkillsYou can program with one or more high-level languages such as Python, Golang, Shell scripting, Ruby or Java You are familiar with DevOps and GitOps practices, driving the integration of observability automation into CI/CD pipelines, e.g.: GitLab, Jenkins, CircleCI or equivalent You have in-depth knowledge of configuration management and Infrastructure as Code (IAC) tools such as Terraform, Ansible, ARM and CloudFormation for provisioning and managing infrastructure You have an expertise in observability, logs, tracing and monitoring tools such as Grafana (Loki and Tempo), Prometheus, Graylog, Jaeger, Zipkin, ELK stack or equivalent You have a strong understanding of container-based architecture and hands-on experience with orchestration tools such as Kubernetes, AWS EKS, Docker Swarm, Nomad, etc. You have in-depth experience in application and infrastructure performance tuning and scaling to handle heavy loads under different scenarios e.g.: Periodic traffic load and tsunami patterns You have a good understanding of essential concepts such as quality gates encompassing SLI/SLO/SLA, chaos engineering, golden signals, blameless postmortem methodologies, synthetic monitoring, distributed tracing, end-user monitoring and performance testing You have experience with network load balancing, security tech stacks, Transport Layer Security (TLS) and certificate management, and an understanding of standard networking protocols and configurations Professional SkillsYou have strong communication and articulation skills, and are proficient in English You are able to convey resolutions to audiences with varying degrees of technical/business proficiency and bring them to consensus You have excellent problem-solving and analytical skills, with a focus on continuous improvement You have good listening and presentation skills You solve challenging problems and difficult to debug issues with a never give up attitude You can collaborate with cross-functional engineering teams to conduct capacity planning and scalability assessments, and design solutions for handling current and future growth You have the ability to work under pressure, with composure, during production incidents You understand requirements provided by the client on both technical and business aspects, and can break them down for successful implementation

Posted 1 month ago

Apply

10.0 - 15.0 years

0 Lacs

Hyderabad, Pune, Gurugram

Work from Office

Analysis of issues via Splunk (including Splunk APM and Splunk O11y), AppDynamics, Grafana, RedMetrics, 1000Eyes Debugging of issues in VMs, Load balancers, Firewalls, API Gateways, DB, Network, Linux / Unix Debugging of issues in Containerization, Docker, Kubernetes, AWS, PCF, Azure Analysis of issues via APM, NMON , Wireshark usage and analysis Database performance monitoring and analysis Experience in UEM and synthetic monitoring set up

Posted 1 month ago

Apply

1.0 - 11.0 years

36 - 57 Lacs

, New Zealand

On-site

URGENT HIRING !!! location's : Canada , Australia , New Zealand ( Not In India ) Benefits : Medical Insurances , Travel allowances , Flight Tickets , Meals , etc For more information call or whatsapp +91 9220850077 Key Responsibilities: Ensure the availability, performance, reliability, and scalability of applications and services. Work collaboratively with Software Engineering to define infrastructure and deployment requirements. Proactively identify and resolve production issues and develop tools, scripts, and frameworks to build and maintain operational efficiency. Conduct routine application performance reviews and incident post-mortems to prevent future outages and improve system reliability. Participate in on-call rotations, demonstrating problem-solving and decision-making abilities to ensure quick resolution of problems. Develop and maintain monitoring tools and alerting systems. Improve CI/CD pipelines, automate routine tasks, and enhance system security. Document all procedures and policies related to the managed systems.

Posted 1 month ago

Apply

6.0 - 13.0 years

6 - 13 Lacs

Hyderabad / Secunderabad, Telangana, Telangana, India

On-site

6-13 years of experience L2 Production Support SRE MS/SQL Server (ok if candidate has SQL,PL/SQL knowledge and has not directly worked on MS SQL) Cloud Dynatrace (or similar alerting and monitoring tools) Job Description What you d be doing in this role supporting the set of applications in Finance Platform Working as Technical Application Service Specialist (TASS), focusing on alerting, monitoring and observability. Ensuring that applications are up and running. Taking preventive actions to avoid incidents/application downtimes. Observability, investigating & fixing service performance issues, with an engineering mentality - resolving via code changes and implementing improvements to prevent repeat issues. Implementing further automation and reducing toil, by utilising existing tooling or implementing new technologies Communicate with Business to understand requirements, socialise ideas amongst peers, respond to user queries. Assess query performance and actively contribute to optimising the code Managing the resolution of issues in Production promptly and effectively Identifying opportunities to make improvements Working closely with the Lab team to understand their backlog changes that will be coming through to RTB Support. Support internal audit by submitting required evidence What you ll need: 7+ years working exp in an Enterprise environment (Financial organisation preferred). Experience with SQL queries, stored procedures, database design and performance tuning (MS SQL Server, Entity Framework, Dapper). Proficient in Transact SQL (T-SQL), SQL Server Integration Services (SSIS) & SQL Server Reporting Services (SSRS). Table/index design and Query Optimisation experience. Hands-on working experience of ETL tools like DataStage, Informatica. Good understanding on DevOps, including experience of Infrastructure as Code and CI/CD pipelines, such as Terraform, Spinnaker and Jenkins Well versed using GIT as Source Control Management tool. Good understanding of Event-driven architecture and Cloud system architectures. Hands-on experience of debugging C# code in SSIS Script components. Basic understanding of .Net Framework. Be proficient in the monitoring tooling such as Dynatrace, Stack Driver/Cloud Operations Suite Highly developed problem-solving skills with minimal supervision. Excellent written and verbal communication skills in English Out of hours On-Call support is required for our applications, and will be managed via a Rota or Shift pattern

Posted 1 month ago

Apply

14.0 - 19.0 years

14 - 18 Lacs

Bengaluru

Work from Office

Position Summary This position manages the activities of systems development, applications development, test strategies and quality assurance functions for system enhancements and new products. Key responsibilities are to develop and manage people, provide technical leadership, lead project planning, facilitate communication, and offer product vision. Coordinates project timelines with Project and Development Managers, determines and obtains resources, assigns work, monitors progress and results, and provides technical leadership. This Manager is a champion for product quality within the department and is accountable for an assessment of product readiness and commitments on product delivery schedules. This is a first-level management position. Primary Responsibilities Employee management including but not limited to sourcing, interviewing and hiring candidates for open positions,onboarding, establishing goals, assigning or delegating work, providing on-the-job training, giving guidance to staff, conducting performance evaluations, approving paid time-off (PTO), developing performance improvement plans, and taking disciplinary action. Recommends changes to policies and establishes procedures that affect immediate organization(s). Act as an advisor to subordinate(s) to meet schedules and/or resolve technical problems. Ensures milestones are being met; monitor, track and make visible Develops and administers schedules, performance requirements; provides input into budgeting. May meet with customers to communicate and review product features May communicates product roadmaps and project status to staff, senior management, and other product teams. Evaluates and reviews new technologies on their applicability to product architecture and design. Prioritizes product features resulting in the correct delivery of needed functionality. Coordinates with development service groups resulting in greater communication and higher probability of on time delivery of products. Responsible for upholding F5s Business Code of Ethics and for promptly reporting violations of the Code or other company policies. Performs other related duties as assigned. Evaluate and solve software failures Improve the existing functionality Work cross functionally integrating, testing and debugging issues with existing system wide software Collaborate with team members and technical leads Build tools and infrastructure to improve F5s components and features Perform other related duties as assigned Knowledge, Skills and Abilities Essential: Excellent analytic trouble-shooting and debugging skills Demonstrated excellence in written and verbal communications Programming efficiency Strong networking fundamentals and experience dealing with different layers of the networking stack. Experience with network and web technologies such as TCP, UDP, IP, HTTP, L4-L7, DNS and such SRE/Devops on Linux & Kubernetes: Demonstrate excellent, hands-on knowledge of deploying workloads and managing lifecyle on kubernetes, with practical experience on debugging issues. On-call Experience in managing everyday OPs for production environments. Experience in production alerts management and using dashboards to debug issues. Cloud Infrastructure: Prior experience in deploying workloads and managing lifecycle on any cloud provider (AWS/GCP/Azure) Knowledge and expertise in software engineering methodologies. Demonstrated ability to lead technical teams Good working experience in Cloud based product development. Good knowledge of microservices architecture and API design and development best practices Working knowledge of development and deployment across multiple cloud providers such as Amazon Web Services, Microsoft Azure, Google Cloud Platform, VMWare and OpenStack Knowledge or experience with Docker containers and orchestration platforms such as Kubernetes Able to collaborate and thrive in a dynamic environment Passion for learning new technologies, and a track record of doing so Track record of mentoring engineering staff Proven ability to deliver products with highest quality, on time and within budget. Demonstrated ability in mentoring and developing direct reports. Extensive experience with bug tracking and triage systems Excellent interpersonal and communication skills. Demonstrated excellence in all written communications. Duties may require being on call periodically or working outside normal working hours (evenings and weekends). Duties may require the ability to travel via automobile or airplane, approximately 10% of the time spent traveling. Nice-to-have: Experience programming in Linux networking and OS internals Agile based software development methodologies such as Kanban, Scrum GipOps: Experience with helm charts/customizations and gitops tools like ArgoCD/FluxCD. Experience with Disaster Recovery and Migration is a plus Qualifications Typically requires a minimum of 14 years of related experience with a Bachelors degree; or 12 years and a Masters degree; or a PhD with 10 year of experience; or equivalent work experience. Environment Empowered Work Culture: Experience an environment that values autonomy, fostering a culture where creativity and ownership are encouraged. Continuous Learning: Benefit from the mentorship of experienced professionals with solid backgrounds across diverse domains, supporting your professional growth. Team Cohesion: Join a collaborative and supportive team where you'll feel at home from day one, contributing to a positive and inspiring workplace.

Posted 1 month ago

Apply

12.0 - 22.0 years

10 - 20 Lacs

Pune

Hybrid

SRE, LINUX, DevOps, Gitlab, Docker and Kubernetes We are looking for Support team member for the application which is MS Azure cloud based Application is Azure SQL Database which will source reference data from Datalake The SRE requirement for providing Infra support on Azure Cloud services So we are looking for people with Strong hands on experience on MS Azure Cloud services Must have experience on Azure SQL Database All the profiles which are shared till now doesnt have enough knowledge and experience on Azure Cloud Infra services

Posted 1 month ago

Apply

5.0 - 10.0 years

40 - 100 Lacs

Pune, Bengaluru, Delhi / NCR

Hybrid

Experience in Site Reliability Engineering, DevOps,managing teams, including mentoring and developing engineers.Prometheus, Grafana, ELK Stack, Splunk, Datadog, New Relic, AWS, GCP, Azure,Docker, Kubernetes,Python, Go, Bash, or simila.

Posted 1 month ago

Apply

5.0 - 8.0 years

25 - 37 Lacs

Thiruvananthapuram

Work from Office

What youll do? Design, develop, and operate high scale applications across the full engineering stack. Design, develop, test, deploy, maintain, and improve software. Apply modern software development practices (serverless computing, microservices architecture, CI/CD, infrastructure-as-code, etc.) Work across teams to integrate our systems with existing internal systems, Data Fabric, CSA Toolset. Participate in technology roadmap and architecture discussions to turn business requirements and vision into reality. Participate in a tight-knit, globally distributed engineering team. Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on network, or service operations and quality. Research, create, and develop software applications to extend and improve on Equifax Solutions. Manage sole project priorities, deadlines, and deliverables. Collaborate on scalability issues involving access to data and information. Actively participate in Sprint planning, Sprint Retrospectives, and other team activity What experience you need Bachelor's degree or equivalent experience 5+ years of relevant software engineering experience 5+ years experience writing, debugging, and troubleshooting code in mainstream Java, SpringBoot, TypeScript/JavaScript, HTML, CSS 5+ years experience with Cloud technology: GCP, AWS, or Azure 5+ years experience designing and developing cloud-native solutions 5+ years experience designing and developing microservices using Java, SpringBoot, GCP SDKs, GKE/Kubernetes 5+ years experience deploying and releasing software using Jenkins CI/CD pipelines, understand infrastructure-as-code concepts, Helm Charts, and Terraform constructs What could set you apart Self-starter that identifies/responds to priority shifts with minimal supervision. Experience designing and developing big data processing solutions using Dataflow/Apache Beam, Bigtable, BigQuery, PubSub, GCS, Composer/Airflow, and others UI development (e.g. HTML, JavaScript, Angular and Bootstrap) Experience with backend technologies such as JAVA/J2EE, SpringBoot, SOA and Microservices Source code control management systems (e.g. SVN/Git, Github) and build tools like Maven & Gradle. Agile environments (e.g. Scrum, XP) Relational databases (e.g. SQL Server, MySQL) Atlassian tooling (e.g. JIRA, Confluence, and Github) Developing with modern JDK (v1.7+) Automated Testing: JUnit, Selenium, LoadRunner, SoapUI Cloud Certification strongly preferred

Posted 1 month ago

Apply

2.0 - 7.0 years

2 - 7 Lacs

Gurgaon / Gurugram, Haryana, India

On-site

Dynamic Yield, a Mastercard company, is committed to connecting and powering an inclusive, digital economy that benefits everyone. Our SSO Data Science team, specifically the Horizontal Data Science Enablement Team, is seeking a Senior MLOps Engineer . This role is crucial for solving complex MLOps problems, managing our organization's Databricks platform, building robust CI/CD and automation pipelines, and championing best practices. If you're passionate about optimizing the machine learning lifecycle and ensuring platform stability, we encourage you to join our team and contribute to cutting-edge data science enablement. All About You As a Senior MLOps Engineer, you will: Databricks Platform Management: Assist in the administration, configuration, and maintenance of Databricks clusters and workspaces . Monitor Databricks clusters for high workloads or excessive usage costs, and promptly alert relevant stakeholders to address issues impacting overall cluster health. Implement and manage security protocols, including access controls and data encryption, to safeguard sensitive information in adherence with Mastercard standards. Facilitate the integration of various data sources into Databricks, ensuring seamless data flow and consistency. Identify and resolve issues related to Databricks infrastructure, providing timely support to users and stakeholders. MLOps Solution Development: Assist in the development of MLOps solutions, including but not limited to: Model monitoring Feature catalog/store Model lineage maintenance CI/CD pipelines to gatekeep model lifecycle from development to production. Operational Excellence: Maintain services once they are live by measuring and monitoring availability, latency, and overall system health. Apply a systematic problem-solving approach to all challenges. What Experience You Need Education: Master's degree in computer science, software engineering, or a similar field. Databricks Expertise: Strong experience with Databricks and its management of roles and resources. MLOps Tools: Experience with MLOps solutions like MLFlow . Data Skills: Experience with performing data analysis, data observability, data ingestion, and data integration. DevOps/SRE Background: 2+ years of DevOps, SRE, or general systems engineering experience. CI/CD Proficiency: 2+ years of hands-on experience in industry-standard CI/CD tools like Git/BitBucket, Jenkins, Maven, Artifactory, and Chef. Programming: Strong coding ability in Python or other languages like Java and C++, plus a solid grasp of SQL fundamentals. Communication: Strong communication skills (both written and verbal). What Could Set You Apart SQL Tuning: Experience with SQL tuning . Automation: Strong automation experience. Global Operations: Ability to operate in a 24x7 environment encompassing global time zones.

Posted 1 month ago

Apply

10.0 - 15.0 years

10 - 12 Lacs

Bengaluru / Bangalore, Karnataka, India

On-site

RESPONSIBILITIES & QUALIFICATIONS BTech/BE/MTech in Computer Science with minimum 6 years of experience Technologies Hands-on developer experience with an awareness of below skills Design and development of web based application using Java/J2EE, REST, Relational and NOSQL databases. Cloud Technologies - AWS , Azure Databases - DB2, Sybase IQ, Mongo DB Programming - Java, Python, Shell script, Terraform Messaging - Kafka, RMQ Frameworks - Spring boot, Spring cloud Site Reliability Engineering (SRE) - Prometheus, Grafana UI - ReactJS, Visualization libraries BI Tools - Alteryx, Tableau, Qlik Sense, Power BI Container - Docker, Kubernetes Preferred Qualifications: 8+ year of industry experience with focus on Technical Architecture, Project management and leadership skills in a fast paced Agile environment. Stakeholder management - experience working with business or clients to transform requirements, provide updates and manage expectations Strong Analytical and Problem solving skills Experience with continuous delivery and deployment practices- preferred experience on Git pipelines. Advocate of strong engineering practices and required to run and maintain a robust engineering plant with SRE, Operational Readiness Experience working with cloud infrastructure and SaaS solutions in a hybrid Cloud environment.

Posted 1 month ago

Apply

7.0 - 13.0 years

10 - 45 Lacs

Hyderabad / Secunderabad, Telangana, Telangana, India

On-site

UK Based MNC Our new technology center in Hyderabad will be home to highly skilled technology and data specialists who will be driving our transformation and delivering great outcomes for UK Based Banking Group's customers. Our office is situated in a sought-after location that features easy transport links and excellent facilities, all aimed at enabling you to achieve a great work-life balance. Working with us means being part ofour aspirational and transformative journey of redefining the fintech landscape, while building an organization that welcomes all. We're committed to providing an exceptional employee experience through our policies, practices, and development opportunities to support you in achieving your potential. This is a once in a career opportunity to shape your future and help us make our mark in India.Are you ready to help shape your future, as well as ours Join us and grow with purpose. What is needed to be considered for this job Essential criteria: Experience using SRE practices would be preferable GCP cloud products such as GKE and spanner; Dataflow and more traditional products such as Java, LINUX, IBM WebSphere, Liberty, Oracle, DB2 Hands on experience in Kubernetes, Terraform & Helm charts. Scripting using bash, shell, python Experience using DevOps tooling such as Jenkins, Harness/GitOps Experience using monitoring tools such as Splunk and Dynatrace Service mindset acquired through supporting critical applications Flexible to work in 24x7 shifts as part of rota including on call and weekends. Experience using Service management tools such as Service NOW Experience supporting applications in a public cloud environment such as GCP & Azure would be an advantage. A passion to automate everything, everywhere! Strong problem-solving abilities in a complex multi-tier technical environment You will be a great communicator across different stakeholder groups You'll want to champion the Service culture and help drive adoption across our engineering labs You'll be able to work independently, use initiative, without management oversight. Experience of applying SRE principles such as: oService Level Management (SLIs, SLOs and SLAs) oBlameless Post Mortems oTOIL Reduction via Automation oCapacity and Cost Planning oChaos Engineering oObservability oProduction Readiness Reviews oIncident and release management So what can we offer you in return Whatever your aspiration, you can also expect excellent benefits, personal development, and a career that is enriching and full of opportunity. You will also receive a package that includes: private medical insurance, discretionary bonus, various share plans, a pension where we contribute up to 15%. 30 days holiday, plus 9 bank holidays and cash sum of 4% which you can exchange for a variety of benefits, or simply take the cash. Our flexible work options mean your work/life balance can be preserved while fully paid training and certification programmes ensure your skills remain fresh. We are an equal opportunities employer and are delighted to receive applications from people of all backgrounds to join our diverse team. If you have particular needs regarding working patterns, childcare, or anything else we can do to accommodate you, please don't shy away from talking to us - we're here to support you and develop your career and your happiness is of the utmost importance to us. We ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform crucial job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation. Our continued dedication to helping Britain recover means that as a colleague you can make a difference to customers, businesses, and communities. Together we have a key role to play in shaping the bank of the future, whilst the scale and reach of our Group means you will continue to have opportunities to learn, grow and develop. We are passionate about creating a values-led culture, and our approach to inclusion and diversity means that we all have the chance to create a real difference, together. Are you interested in joining us Apply today, we'd love to hear from you... Together we'll make it Our continued commitment to helping Britain prosper means that as a colleague you can make a difference to customers, businesses and communities. Together we have a key role to play in shaping the bank of the future, whilst the scale and reach of our Group means you'll continue to have opportunities to learn, grow and develop. We're focused on creating a values-led culture, and our approach to inclusion and diversity means that we all have the opportunity to make a real difference, together. As part of the Group's commitments as a result of ring-fencing legislation, colleagues based in the Islands are required to be exclusively dedicated to the non-ring-fenced bank and its subsidiaries. This means that colleagues who are based in the Islands would not be able to undertake roles for the Ring Fenced Bank from their existing location and would need to consider relocation when applying for roles.

Posted 1 month ago

Apply

5.0 - 8.0 years

11 - 18 Lacs

Hyderabad, Chennai, Bengaluru

Hybrid

AWS DevOps Senior Engineer Who We Are: (www.credera.com) Credera, trading as TA Digital, is a global consulting firm that combines transformational consulting capabilities, deep industry knowledge, and AI and technology expertise to deliver valuable customer experiences and accelerated growth across a broad range of industries worldwide. Our one-of-a-kind global boutique approach means we provide our clients with tailored solutions unique to their organization that can scale due to our extensive footprint. As a values-led organization, our mission is to make an extraordinary impact on our clients, our people, and our community. We believe it is this approach that has allowed us to work with and transform the most influential brands and organizations in the world, from strategy through to execution. More information is available at www.credera.com. We are part of the OPMG Group of Companies, a division of Omnicom Group Inc. Location: Hyderabad / Bangalore / Chennai / Gurugram Work Mode: Hybrid (3 Days per week from Office) About the team you will join: We are an AWS Advance Consulting Partner and Microsoft Azure Gold Partner. Our team has extensive expertise in architecting go-to-market solutions and helps clients leverage Cloud to the best of its capabilities. Our Cloud Services Strategy and Consulting services help clients understand how cloud automation can achieve the business goals. Our DevOps & Cloud Solutions team helps clients build highly reliable, scalable and secure infrastructure with DevOps best practices. This team enhances Cloud capabilities by improving performance, security, dependability and management. Description Job Summary: As a DevOps senior Engineer, you will design, deploy, and manage advanced cloud-based solutions on AWS. This role requires an experience with a strong background in AWS cloud management, DevSecOps, and infrastructure as code (IaC). The ideal candidate will have at least 5 years of relevant experience and will be proficient in implementing and managing AWS networking components, security practices, and CI/CD pipelines. Excellent communication skills and the ability to build effective working relationships are essential, along with the ability lead a team of engineers Key Responsibilities: Provide technical tools, test environments, processes, and development support to the client team. Deploying, automating, maintaining, and managing AWS cloud-based production systems, to ensure the availability, performance, scalability, and security of production systems. Build and operate client production environments. Participate in our evolution toward infrastructure as code. Track usage and capacity plan for hardware and software on an annual basis Mentor junior team members in the construction of delivery systems using configuration management and horizontally scalable architectures. Participate in on-call escalation for client production systems. Participate in on-call rotation for DevTools systems. Required Skillset: 4+ years of experience as a DevOps engineer. Strong knowledge in fundamental DevOps tools such as Terraform, Jenkins, GoCD, GitHub, Nagios, Puppet, Chef, etc. Advanced knowledge and experience with Kubernetes, Amazon managed services and serverless frameworks. Advanced knowledge and experience with GitHub Actions. Experience managing Linux-based operating systems. Fantastic communication and documentation skills both written and verbal. Expertise in automation scripting Experience with one of the dynamic object-oriented programming languages like Groovy, Ruby or Python Detailed experience with AWS. Professional Attributes You Possess: Excellent communication skills. Making decisions and solving problems involving varied levels of complexity, ambiguity and risk. Providing a win-win environment to inspire, attract and develop the best Talents. Boosting teams and individuals potential to improve their engagement, performance and career experience. Questioning conventional approaches, exploring alternatives and responding to challenges with innovative solutions or services, using intuition, experimentation, expertise and fresh perspectives. Problem Solving Attitude and Passion to solve Customer Problems. Facilitating efficiency, speed and simplification in all actions. Adapting to market changes and competitive pressures and enables fast learning. Demonstrating passion for our business and constant hunger for outstanding performance.

Posted 1 month ago

Apply

6.0 - 9.0 years

5 - 15 Lacs

Bengaluru, Mumbai (All Areas)

Hybrid

Dear Candidate, Please find below Job Description Azure SRE Lead Lead and mentor the team, foster SRE mindset & culture. Define SRE SLO, SLI & runbook. Recommended corrective actions and solutions for auto-healing. Regards Divya Grover +91 8448403677

Posted 1 month ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies