Production and Support Engineer

5 - 10 years

4 - 9 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Who are we looking for?

Application Support Engineer with overall experience of 4+ years of experience in development and supporting Complex and critical large scale distributed systems and extensive hands-on experience in handling production failures & driving root cause analysis and remediation.

Primary Responsibilities:

  • Proactively manage production outages and performance issues through quality analysis and rapid resolution.
  • Lead incident management, ensuring clear communication with users, application owners, and senior stakeholders.
  • Perform root cause analysis (RCA) and document post-mortems for incidents.
  • Identify recurring patterns, conduct thorough post-mortems, and implement permanent fixes to prevent reoccurrence.
  • Collaborate with engineering teams to automate ing and recovery processes, reducing manual intervention.
  • Drive continuous improvement by minimizing manual tasks through automation and dynamic monitoring solutions.
  • Build and enhance runbooks to streamline operational workflows.
  • Work with development teams to improve application observability and enable faster MTTD (Mean Time to Detect) and MTTR (Mean Time to Resolve).
  • Identify automation opportunities in s and processes, and partner with engineering to implement them.
  • Possess strong understanding of deployment methodologies with hands-on experience in production deployments.
  • Skilled in instrumentation, monitoring, ing, and incident response using tools like AppDynamics, Splunk, ThousandEyes, and ITRS.

Technical Skills:

  • Over 5 years of IT experience, including 3+ years in L2 application support.
  • Proficient in managing large-scale production systems involving load balancing, distributed systems, microservices, monitoring, and configuration management.
  • Strong hands-on experience in troubleshooting:
    • Application failures and performance degradation
    • Code-level issues and cloud platform incidents
    • Batch, infrastructure, database, and network failures
  • Skilled in ITIL practices: Event, Incident, Release, Problem, and Knowledge Management.
  • Experienced in production deployments using CI/CD tools.
  • Solid understanding of SLIs, SLOs, and handling burn rate s.
  • Working knowledge of SQL and NoSQL databases.
  • Proficient in monitoring and ing tools like AppDynamics, Splunk, ThousandEyes, and ITRS.
  • Expertise in cloud technologies, with a preference for PCF (Pivotal Cloud Foundry).
  • Expertise in Scripting technologies like Shell, Powershell, Python.

Mandatory skills : Solid Production Support (application + batch-Control-M) & Troubleshooting + Splunk + Linux/Windows Admin(Moderate) + Production Deployment Experince + Scripting(Moderate) + good to have -DevOps & Cloud + Other APM Tools

  • Immediate joiners only
  • Ready to work from office
  • Ready to work in 24*7
  • Living within the city premises where office cabs operate.
  • Work from Mphasis Office - ODC
  • Work Location Hyderabad and Bangalore
  • Rotational Shift Timings (6:30 AM TO 3:30 PM // 2:00 PM TO 11:00 PM // 10:00 PM TO 6:00 AM)

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Mphasis logo
Mphasis

Information Technology and Services

Grapevine

RecommendedJobs for You