Incident Manager

8 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As Incident Manager IV, you will be the link between our Support, Engineering and Infrastructure teams. You will enable a better experience for our customers by organizing and driving the investigation of production issues in our application, which is a SaaS product consisting of Spring based microservices, ML models and data pipelines hosted within the AWS infrastructure, and report on these to Engineering, Support and other stakeholders. In doing so, you will also have a positive impact on the quality of the product. We are looking for somebody who is passionate about product quality, has extreme customer empathy, and is constantly looking to improve the quality of our services.

This is an engineering position, not a management position.


Role Value:

Your work will directly contribute to greater customer satisfaction by providing information about product issues in a timely manner. You will also help our Sales teams by answering technical questions about our infrastructure in customer RFP’s.

Key Responsibilities

  • Investigate production issues raised by customers, Support and Engineering
  • Work as a liaison between Support and Engineering to facilitate issue resolution, root cause analysis (RCA), and drive the implementation of learnings
  • Create and track progress of problem tickets in Jira
  • Create incident analysis reports with the support of Engineering teams
  • Perform log file analysis with Datadog
  • Debugging of basic REST API calls for investigations
  • Execute SQL database queries to provide more information for investigations
  • Create and update knowledge base articles in Confluence
  • Participate in security audits (PCI DSS, ISO 27001, SOC2) and preparing supporting evidence

Skills & Qualifications

Must-Have Skills:

  • Working experience of at least 8 years in IT (SRE, sysadmin, developer, QA, technical support, or similar)
  • University degree in a relevant field
  • Strong analytical, problem-solving and collaboration skills
  • Basic understanding of systems architecture of cloud hosted applications
  • Data analysis skills - creating and interpreting dashboards to distinguish between real issues and false positives
  • Project management and documentation skills such as Jira and Confluence
  • Excellent written and verbal communication skills in English
  • Knowledge of cloud, preferably AWS, infrastructure components
  • Experience with REST APIs and tools e.g. Postman
  • Experience with application logging/monitoring tools e.g. Kibana, Datadog;
  • Experience with SQL, Linux & Network environments
  • Willingness to learn new technical skills


Nice-to-Have Skills:

  • Understanding of basic ML concepts and LLM’s
  • experience with Git or similar version control system
  • experience with agile software development process
  • Jenkins or similar CI pipeline
  • Bash scripting for Linux
  • basic skills in software development e.g. Java, Python, JavaScript, Go;
  • experience with Docker & Microservices
  • network and application security
  • working within a PCI DSS environment

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now
Talentoj logo
Talentoj

Human Resources

Talent City

RecommendedJobs for You

Bengaluru, Karnataka, India

Thiruporur, Tamil Nadu, India

Thiruporur, Tamil Nadu, India

Noida, Uttar Pradesh, India