Observability Engineer

5 - 10 years

9 - 13 Lacs

Posted:16 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Your Primary Responsibilities:

  • Design and implement observability solutions across distributed systems using Grafana, Splunk ITSI, and Dynatrace.
  • Develop and maintain custom dashboards and visualizations tailored to business and operational needs.
  • Integrate observability tools with various data sources (e.g., Prometheus, CloudWatch, Service Now, Snowflake).
  • Collaborate with application and infrastructure teams to define SLIs/SLOs and improve system reliability.
  • Troubleshoot and resolve issues related to monitoring gaps, alert noise, and data ingestion.
  • Participate in tool rationalization efforts and contribute to proof-of-concept initiatives for new observability capabilities.
  • Support automation initiatives including agent provisioning and configuration across Linux and Windows environments.
  • Contribute to the development of self-healing and anomaly detection frameworks using Splunk ITSI and Dynatrace AI capabilities.

Qualifications:

  • Minimum of 05+ years of related experience
  • Bachelor's degree preferred or equivalent experience

Talents Needed for Success:

  • 5+ years of experience in observability, monitoring, or site reliability engineering.
  • Hands-on experience with Grafana, Splunk (including ITSI), and Dynatrace.
  • Strong understanding of telemetry data types and observability architecture.
  • Experience with scripting (Python, Bash, PowerShell) and automation tools.
  • Familiarity with cloud platforms (AWS and Azure) and containerized environments (Kubernetes).
  • Excellent problem-solving skills and ability to work in a fast-paced, collaborative environment.
  • Strong communication skills.
  • Working knowledge in Open Telemetry.

Preferred Qualifications:

  • Experience with integrating observability tools into CI/CD pipelines.
  • Knowledge of ITSM tools like ServiceNow and incident response platforms like PagerDuty.
  • Exposure to AIOps, anomaly detection, and predictive analytics use cases.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Dtcc logo
Dtcc

Financial Services

Jersey City NJ

RecommendedJobs for You

hyderabad, telangana, india