Site Reliability Engineer

8 - 13 years

22 - 25 Lacs

Posted:7 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Role & responsibilities

Job Summary:

SRE Observability Engineer

Key Responsibilities:

  • Build and maintain unified dashboards

    for real-time monitoring of:
    • Production support health (Derived from ServiceNow or similar ITSM platforms).
    • Application performance and availability (Using tools like AppDynamics, Dynatrace, etc.).
  • Integrate logs and observability data

    with ITSM tools to streamline incident management and reporting.
  • Collaborate with support and development teams

    to identify key metrics and KPIs for monitoring application and infrastructure health.
  • Identify automation opportunities

    to improve operational efficiency and reduce manual effort in monitoring and incident response.
  • Develop and implement self-healing solutions

    using scripting languages such as Python or Java to automatically resolve recurring incidents.
  • Continuously enhance monitoring strategy

    to support scalability, reliability, and high availability of production systems.
  • Drive observability best practices

    across teams and support the adoption of modern monitoring, alerting, and logging standards.

Required Skills & Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • Proven experience (3+ years) in SRE, Observability, or Production Support roles.
  • Hands-on experience with observability platforms such as:
    • AppDynamics, Dynatrace, New Relic, Prometheus, Grafana, etc.
  • Proficiency in scripting languages such as

    Python

    and

    Java

    for automation and self-healing.
  • Strong experience in building

    custom dashboards

    and

    alerting rules

    based on production metrics.
  • Experience with

    ITSM tools

    like ServiceNow, and integration of monitoring/log data into ITSM workflows.
  • Solid understanding of monitoring concepts including

    SLAs, SLOs, and SLIs

    .
  • Familiarity with log management tools (e.g., ELK, Splunk) is a plus.
  • Excellent problem-solving skills and ability to work under pressure in a production environment.

Preferred Qualifications:

  • Knowledge of containerization and orchestration tools like Docker and Kubernetes.
  • Experience with CI/CD pipelines and DevOps practices.
  • Exposure to cloud environments (AWS, Azure, GCP) and their monitoring services.

Mock Interview

Practice Video Interview with JobPe AI

Start Java Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Java Skills

Practice Java coding challenges to boost your skills

Start Practicing Java Now
Techno Facts Solutions logo
Techno Facts Solutions

Information Technology Consulting

Tech City

RecommendedJobs for You