Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in pune
>
Siemens
>
Site Reliability Engineer (SRE) - Incident Commander

Site Reliability Engineer (SRE) - Incident Commander

Siemens

3 - 8 years

20 - 25 Lacs

pune

Posted:1 month ago| Platform:

Apply

Skills Required

kubernetes docker gcp troubleshooting aws python cyber security service management arcsight soc information security microsoft azure siem incident response qradar incident management threat hunting splunk security operations center jira

Work Mode

Work from Office

Job Type

Full Time

Job Description

The DISW SRE organization is dedicated to enhancing service and application availability, optimizing processes by automating manual and repetitive tasks, and addressing complex technical challenges in a dynamic, collaborative, inclusive, and iterative environment. This position plays a crucial role in developing automated solutions and processes that support and sustain best-in-class cloud-based applications.

The candidate will support the Siemens Xcelerator platform and will be for coordinating major incident response, maintaining partner communication during service-impacting events, and facilitating resolution in compliance with service level agreement (SLA). Strong communication & coordination skills are necessary to support core objectives. This roles success will be defined by product teams within DISW business units meeting their SLAs.

Key Responsibilities

Incident Management: Act as the primary point of contact and leader during major incidents, coordinating the response, communication, and resolution efforts across all involved teams.
Incident Response: Quickly assess the severity of incidents, determine the impact, and drive the appropriate response to restore services as quickly as possible.
Communication: Ensure clear, concise, and timely communication with stakeholders, including technical teams, management, and customers, throughout the incident lifecycle.
Post-Incident Analysis: Lead post-incident reviews to identify root causes, drive improvements, and implement preventive measures to reduce the likelihood of recurrence.
Collaboration: Work closely with SRE, DevOps, Development, and other relevant teams to ensure that incident management processes are well-defined and continuously improved.
Training & Preparedness: Conduct regular incident response drills, train teams on incident management processes, and ensure readiness for handling high-severity incidents.
Documentation: Maintain and update incident management documentation, ensuring that all procedures are up-to-date and accessible to all relevant teams.
Monitoring & Alerts: Collaborate with SRE and monitoring teams to define and refine alerting criteria, ensuring that incidents are detected and escalated promptly.
Continuous Improvement: Find opportunities to improve system reliability, scalability, and performance based on lessons learned from incidents.
24x7 On-call rotation: Participate in 24x7 on-call rotation.

Qualifications:

Technical Skills
: Familiar with cloud infrastructure (AWS, GCP, Azure), containerization (Docker, Kubernetes)
Certifications: Relevant certifications (e.g., AWS Certified Solutions Architect, Certified Kubernetes Administrator) are a plus.
Automation: Experience with automation tools and scripting languages (e.g., Python, Bash) to streamline incident response and remediation.
Stakeholder Management: Experience aligning with cross-functional teams including business and product stakeholders during and after incidents.
Metrics Ownership: Ability to define and track incident-related critical metrics (e.g., MTTR, MTTD) to drive accountability and improvement.
Experience: Enterprise IT environment with distributed environments
Communication: Outstanding English communication skills, both verbal and written, as well as, listening and synthesis skills.
Incident Response: Quickly assess the severity of incidents, determine the impact, and drive the appropriate response to restore services as quickly as possible.
Problem-Solving: Excellent troubleshooting and problem-solving skills, with the ability to quickly analyze complex systems.
Calm Under Pressure: Ability to remain calm, focused, and effective in high-pressure situations. The ability to make quick, confident decisions.
Leadership: Demonstrated experience in leading incident response efforts and managing cross-functional teams during critical situations.
Technical Skills: Familiar with Jira Service management (or equivalent i.e. ServiceNow), Datadog (or equivalent i.e. Grafana), PagerDuty (or equivalent), Atlassian Status page (or equivalent).
Driven Learner: Highly motivated and driven to learn new technologies, skills, and methodologies, continuously seeking to expand your knowledge and adapt to evolving industry trends.
Must be willing and available to work the core hours required

More Jobs at Siemens

Process Expert - Opportunity to Cash

Bengaluru, Karnataka

Experience: Not specified

Salary: Not disclosed

Process Expert - Site Installation

Bengaluru, Karnataka

Experience: Not specified

Salary: Not disclosed

Process Expert - Testing and Commissioning

Bengaluru, Karnataka

Experience: Not specified

Salary: Not disclosed

Business Administration - Proj Exc Commercial

Bengaluru, Karnataka

Experience: Not specified

Salary: Not disclosed

EHS Country Head

Mumbai, Maharashtra, India

5 - 5 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Siemens

Automation Machinery Manufacturing

Munich Brande

Login to

Please Verify Your Phone or Email

Confirm Action

Site Reliability Engineer (SRE) - Incident Commander