Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 - 8.0 years
7 - 15 Lacs
hyderabad
Work from Office
Role Purpose The purpose of this role is to lead DevOps team to facilitate better coordination among operations, development and testing functions by automating and streamlining the integration and deployment processes Do Drive technical solution support to the team to align on continuous integration (CI) and continuous deployment (CD) of technology in applications Design and define the overall DevOps architecture/ framework to for a project/ module delivery as per the client requirement Decide on the DevOps tool & platform and which needs to be deployed aligned to the customers requirement Create a tool deployment model for validating, testing and monitoring performance and align or provision for resources accordingly Define & manage the IT infrastructure as per the requirement of the supported software code Manage and drive the DevOps pipeline that supports the application life cycle across the DevOps toolchain from planning, coding and building, to testing, to staging, to release, configuration and monitoring Work with the team to tackle the coding and scripting needed to connect elements of the code that are required to run the software release with operating systems and production infrastructure with minimum disruptions Ensure on boarding application configuration from planning to release stage Integrate security in the entire dev-ops lifecycle to ensure no cyber risk and data privacy is maintained Provide customer support/ service on the DevOps tools Timely support internal & external customers escalations on multiple platforms Troubleshoot the various problems that arise in implementation of DevOps tools across the project/ module Perform root cause analysis of major incidents/ critical issues which may hamper project timeliness, quality or cost Develop alternate plans/ solutions to be implemented as per root cause analysis of critical problems Follow escalation matrix/ process as soon as a resolution gets complicated or isnt resolved Provide knowledge transfer, sharing best practices with the team and motivate Team Management Resourcing Forecast talent requirements as per the current and future business needs Hire adequate and right resources for the team Train direct reportees to make right recruitment and selection decisions Talent Management Ensure 100% compliance to Wipros standards of adequate onboarding and training for team members to enhance capability & effectiveness Build an internal talent pool of HiPos and ensure their career progression within the organization Promote diversity in leadership positions Performance Management Set goals for direct reportees, conduct timely performance reviews and appraisals, and give constructive feedback to direct reports. Incase of performance issues, take necessary action with zero tolerance for will based performance issues Ensure that organizational programs like Performance Nxtarewell understood and that the team is taking the opportunities presented by such programs to their and their levels below Employee Satisfaction and Engagement Lead and drive engagement initiatives for the team Track team satisfaction scores and identify initiatives to build engagement within the team Proactively challenge the team with larger and enriching projects/ initiatives for the organization or team Exercise employee recognition and appreciation Mandatory Skills: Docker.Experience: 5-8 Years.
Posted Date not available
6.0 - 11.0 years
22 - 37 Lacs
gurugram, delhi / ncr
Hybrid
The SRE team at GreyOrange is responsible for monitoring the stability and availability of mission-critical production systems, managing incidents for quicker resolution, and establishing BAU. The team also manages and maintains internal tools/infra which is consumed by other development teams. The experienced SRE will play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applications. The ideal candidate will have a strong background in software engineering, system administration, containerization, and cloud technologies. Requirements Should have 6 to 11 years of experience Well-versed with scripting/programming languages (Python/Bash/PowerShell, etc.) to automate manual work, particularly within cloud environments Well-versed with Observability tools (Grafana, Splunk, Dynatrace) for monitoring, alerting, and logging solutions to identify and address potential issues, especially in cloud infrastructure Working experience with automation tools (Jenkins, GitLab, Ansible/Chef for configuration management) and processes to streamline deployment, monitoring, and management of systems and applications in the cloud Hands-on experience with containerization and orchestration technologies such as Docker, Kubernetes, or similar, particularly in cloud-native environments Well aware of SLI, SLO, SLA, and Error Budget concepts and their implementations; provide on-call support and participate in incident management & response activities as needed Expert with troubleshooting production issues and bugs. Good knowledge of Unix systems, networking, web technologies, and databases. Incident Management experience coupled with effective communication skills for production workload. Working knowledge in any one of the cloud platforms (AWS or GCP) What you'll do? Lead reliability engineering projects and drive them to closure. Ensure system stability and high availability by proactively monitoring performance and troubleshooting issues Design, build and maintain efficient, reliable, and scalable cloud-based infrastructure and services Automate processes and find opportunities to improve the observability and availability of the Platform to reduce toil. Implement and manage observability tools for comprehensive monitoring, alerting, and logging Own end-to-end availability and performance of different services & tools. Practice sustainable incident response and blameless postmortems. Provide on-call support for incident management and participate actively in response activities
Posted Date not available
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
73564 Jobs | Dublin
Wipro
27625 Jobs | Bengaluru
Accenture in India
22690 Jobs | Dublin 2
EY
20638 Jobs | London
Uplers
15021 Jobs | Ahmedabad
Bajaj Finserv
14304 Jobs |
IBM
14148 Jobs | Armonk
Accenture services Pvt Ltd
13138 Jobs |
Capgemini
12942 Jobs | Paris,France
Amazon.com
12683 Jobs |