Monitoring and Observability Engineer (Prometheus & Grafana Specialist)

3 - 5 years

0 Lacs

Posted:1 week ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Position:

Experience:

Location:

Job Type:

Send your cv [HIDDEN TEXT]

About the Role

Monitoring and Observability Engineer

Key Responsibilities

  • Design, implement, and maintain robust

    monitoring and alerting

    solutions using

    Prometheus

    and

    Grafana

    for mission-critical systems.
  • Write and optimize

    PromQL

    queries for efficient data retrieval and analysis.
  • Create

    highly customized Grafana dashboards

    for large, complex datasets with a focus on performance, readability, and actionable insights.
  • Develop and maintain

    custom Grafana plugins

    (data source, panel, app) using

    JavaScript, TypeScript, React, and Go

    .
  • Integrate Prometheus and Grafana with various

    data sources

    (databases, cloud services, APIs, log aggregation tools such as Loki or ELK).
  • Configure and manage

    Alertmanager

    for alert routing, notifications, and escalations.
  • Troubleshoot performance, data collection, and visualization issues.
  • Collaborate with

    SRE, DevOps, and development teams

    to translate monitoring needs into effective observability solutions.
  • Implement best practices for monitoring, alerting, and scalability.
  • Automate setup and configuration using

    Terraform, Ansible

    , or similar IaC tools.
  • Keep up-to-date with emerging trends in the

    Prometheus

    and

    Grafana

    ecosystem.
  • Document configurations, dashboards, and troubleshooting processes.

Required Skills & Qualifications

  • Bachelors in

    Computer Science, IT

    , or related field.
  • 2+ years of

    hands-on production experience

    with Prometheus & Grafana.
  • Strong

    PromQL

    expertise.
  • Advanced

    Grafana dashboard customization

    for large-scale datasets.
  • Experience developing

    Grafana plugins

    using JavaScript, TypeScript, React, and/or Go.
  • Knowledge of monitoring best practices and alerting strategies.
  • Familiarity with

    Prometheus exporters

    .
  • Experience with

    Docker, Kubernetes

    , and cloud platforms (AWS, Azure, GCP).
  • Proficiency in scripting (

    Python, Bash

    ) for automation.
  • Strong troubleshooting, analytical, and communication skills.

Preferred (Good to Have)

  • Experience with

    Loki, Jaeger, OpenTelemetry

    .
  • Knowledge of distributed tracing and log management.
  • GitOps experience for monitoring configuration management.
  • Contributions to

    Prometheus

    or

    Grafana

    open-source projects.
  • Relevant Prometheus/Grafana certifications.

  • Technical & Role-Specific Hashtags

#MonitoringEngineer

#ObservabilityEngineer

#Prometheus

#Grafana

#PromQL

#GrafanaPlugins

#SREJobs

#DevOpsJobs

#MonitoringAndAlerting

#DashboardDevelopment

#WeAreHiring

#HiringNow

#JobOpening

#TechJobs

#BangaloreJobs

#ITJobs

#EngineeringJobs

#CareerOpportunity

#JoinOurTeam

#AWS

#Azure

#GCP

#Kubernetes

#Docker

#CloudComputing

#InfrastructureAsCode

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You