Monitoring & Observability Engineer – Datadog Specialist 4 + yrs

4 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

Remote

Job Type

Full Time

Job Description

Job Title: Monitoring & Observability Engineer – Datadog Specialist

Experience: 4+ Years

Location: [Specify Location or Remote]

Job Type: Full-Time


Job Summary:

We are looking for a talented Observability Engineer with hands-on experience in Datadog to enhance our infrastructure and application monitoring capabilities. The ideal candidate will have a strong understanding of performance monitoring, alerting, and observability in cloud-native environments.


Key Responsibilities:

Design, implement, and maintain observability solutions using Datadog for applications, infrastructure, and cloud services.

Set up dashboards, monitors, and alerts to proactively detect and resolve system issues.

Collaborate with DevOps, SRE, and application teams to define SLOs, SLIs, and KPIs for performance monitoring.

Integrate Datadog with services such as AWS, Kubernetes, CI/CD pipelines, and logging tools.

Conduct performance tuning and root cause analysis of production incidents.

Automate observability processes using infrastructure-as-code and scripting (e.g., Terraform, Python).

Stay up-to-date with the latest features and best practices in Datadog and observability space.


Must-Have Skills:

4+ years of experience in monitoring/observability, with 2+ years hands-on experience in Datadog

Strong experience with Datadog APM, infrastructure monitoring, custom metrics, and dashboards

Familiarity with cloud platforms like AWS, GCP, or Azure

Experience monitoring Kubernetes, containers, and microservices

Good knowledge of log management, tracing, and alert tuning

Proficient with scripting (Python, Shell) and IaC tools (Terraform preferred)

Solid understanding of DevOps/SRE practices and incident management


Nice-to-Have Skills:

Datadog certifications (e.g., Datadog Certified Observability Engineer)

Experience integrating Datadog with CI/CD tools, ticketing systems, and chatops

Familiarity with other monitoring tools (e.g., Prometheus, Grafana, New Relic, Splunk)

Knowledge of performance testing tools (e.g., JMeter, k6)


Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now