Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
7.0 - 12.0 years
15 - 30 Lacs
Hyderabad
Work from Office
Site Reliability Engineer Required Technical Skill Set-: • Practical experience with Monitoring tools, such as: Grafana, Azure Monitor, Log Analytics, Network Monitoring and Alerting Tools (i.e. Big Panda). • Experience with Automation Tooling, such as: Azure Open AI, Amelia Automation, Service Now Orchestration, Power Apps / Power Platform, Python and PowerShell. Good foundational understanding of Agile Methodologies, AI/ML for automating operational initiatives and ITIL / Change Management processes. • Knowledge of core Azure Cloud computing concepts (AZ-900 Certification as a minimum requirement, with AZ-104 certification preferred). • Knowledge of Azure Chaos Studio for Chaos Engineering Minimum 5 mandate details are mandate with two or 3 liners 1. Implementing proactive remediation automation based on past issues / incidents and hypothesising use cases where an issue / incident may, thereby, automatically restoring stability, should the incident occur. 2. Track record of implementing Monitoring tooling to encompass: Health state of Infrastructure, Network, Log & Events, Performance, Capacity and Synthetic monitoring. 3. Experience in Data Correlation & Analysis and Configuring Alerts for detected issues / incidents. 4. Knowledge of Azure Open AI, and how various data sources can be integrated with the AI for data analysis, in order to initiate events based on informed decision making. 5. Experience in leading Blameless Post-Mortems, following production incidents / outages, in order to identify opportunities for improvement
Posted 1 month ago
8.0 - 10.0 years
27 - 42 Lacs
Hyderabad
Work from Office
Job Summary Lead deliver and monitor key improvements to the currently installed systems and infrastructure. Work with product owners architects and others to implement world-class solutions that meet regulatory and customer needs. Drive improvements and upgrades to the environment from conception through to implementation Responsibilities Install configure test and maintain operating systems application software and system management tools Provide an advanced level of support to the existing environment Identify prioritize and execute tasks in the software development life cycle (SDLC) Proactively ensure the highest levels of systems and infrastructure availability Maintain security backup and redundancy strategies Maintain CI/CD pipelines to automate routine build and testing activities Write and maintain custom scripts to increase system efficiency and lower human intervention time on any tasks Work with an issue/problem management system to ensure services are provided according to relevant SLA(s) Participate in the backlog grooming and sprint planning sessions analysing requirements providing complexity estimates and proposing low-level implementation plans. Collaborate with a global group of internal teams that span Asia Europe and Americas. Ensure software is up-to-date with latest technologies and standards Assist front-line support teams in resolving customer and production issues. Escalate risks and issues and provide status reports for management. Write and maintain appropriate documentation for manual and automated processes. Understand existing complex environments and be able to easily identify problem areas and undertake successful implementations to resolve and/or mitigate. Perform occasional weekend work (e.g. patching upgrades VM migrations) Minimum 5 years experience as Devops/Site Reliability Engineer Working experience in installing configuring and troubleshooting Windows and Linux environments Experience with Automation CI/CD Gitlab Ansible and Terraform Scripting skills and (Bash Python) Working experience in setting up SLI/SLO/SLA for any new services in the monitoring systems Experience with monitoring systems Genios Big Panda Datadog) Experience with virtualization and containerization (VMware Docker Kubernetes EKS) Fair understanding of Agile methodology Experience in AWS Azure Windows eco system is plus Good working knowledge Jira Confluence and SNOW tools
Posted 2 months ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
Accenture
39581 Jobs | Dublin
Wipro
19070 Jobs | Bengaluru
Accenture in India
14409 Jobs | Dublin 2
EY
14248 Jobs | London
Uplers
10536 Jobs | Ahmedabad
Amazon
10262 Jobs | Seattle,WA
IBM
9120 Jobs | Armonk
Oracle
8925 Jobs | Redwood City
Capgemini
7500 Jobs | Paris,France
Virtusa
7132 Jobs | Southborough