3 - 6 years

8 - 18 Lacs

Posted:Just now| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Roles and Responsibilities

Service Reliability & Operations

  • Ensure high availability and optimal performance of key systems through active monitoring and proactive issue resolution.
  • Work with

    Service Level Indicators (SLIs)

    and

    Service Level Objectives (SLOs)

    to achieve operational goals.
  • Identify recurring issues and drive root-cause fixes to improve service stability.

Incident & Problem Management

  • Participate in incident response to minimize downtime and business impact.
  • Perform root cause analysis and document post-incident reports.
  • Collaborate with engineering teams to develop long-term solutions for known issues.
  • Troubleshoot and debug

    Java

    and

    microservices-based

    applications.

Support & Escalation Handling

  • Handle support tickets and technical escalations efficiently.
  • Follow through on open issues until resolution, ensuring minimal recurrence.
  • Maintain and enhance support documentation,

    runbooks

    , and

    SOPs

    .
  • Coordinate with cross-functional teams during incident resolution.

Monitoring & Automation

  • Use tools like

    Prometheus

    ,

    Grafana

    , or

    New Relic

    for system monitoring and alerting.
  • Contribute to automation initiatives and develop scripts to improve operational efficiency.
  • Maintain documentation for monitoring, support procedures, and operational practices.

Collaboration & Communication

  • Partner with development, infrastructure, and support teams to ensure seamless operations.
  • Provide timely updates on incident status and overall service health.
  • Maintain clear and transparent communication across all stakeholders during incidents.

Required Skills

  • 35 years of experience in

    SRE

    ,

    IT Operations

    , or

    Technical Support

    roles.
  • Strong understanding of

    Java

    and

    microservices architecture

    .
  • Experience with monitoring tools (Prometheus, Grafana, New Relic).
  • Hands-on experience with

    Kubernetes

    and cloud platforms (

    AWS

    ,

    Azure

    ,

    GCP

    ).
  • Familiarity with

    ITIL-based

    support practices and service management.
  • Excellent troubleshooting, analytical, and communication skills.

Preferred Skills

  • Experience with ITSM/ticketing tools such as

    ServiceNow

    or

    Jira Service Management

    .
  • Knowledge of automation or scripting (

    Bash, Python

    , etc.).
  • Awareness of

    security

    and

    compliance

    best practices in operations.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
LANDMARK GROUP logo
LANDMARK GROUP

Retail

Dubai Riyadh

RecommendedJobs for You

noida, uttar pradesh, india

noida, hyderabad, gurugram

chennai, tamil nadu, india