4 Reliability Metrics Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 9.0 years

0 Lacs

panna, madhya pradesh

On-site

As a highly experienced Senior Solution Monitoring Tool Architect specializing in Monitoring Tools and AI-driven Observability, your role is pivotal in designing, implementing, and evolving enterprise-grade monitoring architectures that leverage AI/ML for predictive insights, automated incident response, and real-time analytics across hybrid and cloud-native environments. Key Responsibilities: - Design and implement scalable, resilient monitoring architectures using platforms like Prometheus, Grafana, ELK, Datadog, or Dynatrace. - Architect AI-enhanced observability solutions using ML models for anomaly detection, root cause analysis, and predictive maintenance. - Define and implement SLOs/S...

Posted 5 days ago

AI Match Score
Apply

10.0 - 14.0 years

0 Lacs

pune, maharashtra

On-site

As a member of the Center of Excellence (COE) team in the Allied Industries vertical at Infinite Uptime Process Division, you will be instrumental in driving manufacturing excellence and process improvement in heavy process industries such as chemicals, food & beverage, and allied sectors. Your main responsibilities will include: - Analyzing plant data to identify deviations in key parameters like Performance, Golden batch adherence, yield, quality, uptime, and energy efficiency. - Recommending corrective actions and driving measurable improvements using our ODR (Outcome-Driven Reliability) framework. - Collaborating with cross-functional teams to turn insights into results for our customers...

Posted 1 week ago

AI Match Score
Apply

3.0 - 8.0 years

0 Lacs

jaipur, rajasthan

On-site

As a Senior Site Reliability Engineer (SRE) at Datadog Observability, you will be responsible for leading end-to-end SRE implementation initiatives with a strong focus on Datadog. Your role will involve designing, configuring, and managing Datadog dashboards, monitors, alerts, and APM for proactive issue detection and resolution. You will collaborate with various teams to identify observability gaps and implement automation for alerting, incident response, and ticket creation to improve operational efficiency. Additionally, you will provide technical leadership in observability, reliability, and performance engineering practices. Key Responsibilities: - Drive end-to-end SRE implementation, e...

Posted 3 weeks ago

AI Match Score
Apply

5.0 - 9.0 years

0 Lacs

thane, maharashtra

On-site

As a Reliability Engineer at our company, you will be responsible for developing and implementing reliability engineering strategies to ensure the reliability, availability, and maintainability of our Thermal Products. You will collaborate with a cross-functional team, including product, design, manufacturing, service, and quality, to ensure that reliability engineering principles are incorporated at the product design stage. Key responsibilities include: - Developing and managing reliability engineering processes such as failure mode analysis (FMEA), Fault Tree Analysis (FTA), and Reliability Block Diagram (RBD) analysis. - Analyzing and resolving reliability issues, such as conducting Root...

Posted 1 month ago

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies