Lead Monitoring Engineer - BHOM

8 - 13 years

10 - 14 Lacs

Posted:2 weeks ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

We are looking for a Lead Monitoring Engineer to take ownership of our enterprise-wide
monitoring and observability platforms. In this role, youll lead the design, implementation, and
optimization of monitoring solutions using BMC Helix Operations Management (BHOM) and
Prometheus across a large-scale, global SaaS environment.
Youll work at the intersection of infrastructure, DevOps, and operations driving initiatives that
improve visibility, reliability, and performance for our critical business systems. This is an
opportunity to shape how monitoring and AIOps are implemented across the organization.

Here is how, through this exciting role, YOU will contribute to BMC's and your own success:

Monitoring Strategy & Architecture

  • Define and execute a comprehensive monitoring and observability strategy for hybrid and cloud
  • environments.
  • Design and deploy scalable monitoring frameworks using BHOM and Prometheus.
  • Enhance observability through AIOps, enabling proactive detection and resolution of system
anomalies.

System Administration & Optimization

  • Manage and maintain monitoring tools for high availability and performance.
  • Establish intelligent alerting and visualization practices to minimize false positives.
  • Continuously evolve monitoring coverage, dashboards, and data integrations.

Leadership & Collaboration

  • Lead and mentor a team of monitoring engineers and administrators.
  • Partner closely with DevOps, SRE, and application teams to embed observability best
  • practices into CI/CD pipelines.
  • Drive automation and standardization for monitoring configurations and operational workflows.
  • Reporting & Insights
  • Build executive-level dashboards and reports highlighting system health, performance trends,
  • and key reliability metrics.
  • Maintain clear documentation and knowledge repositories for all monitoring configurations and
procedures.

To ensure youre set up for success, you will bring the following skillset & experience:

Technical Expertise

  • Strong hands-on experience with BMC Helix Operations Management (BHOM) and
  • Prometheus.
  • Solid understanding of monitoring principles, event correlation, and metric-based alerting.
  • Proficiency in Python or other scripting languages for automation.
  • Expertise in PromQL for custom metric analysis and dashboarding.
  • Familiarity with log analytics tools (e.g., Kibana, ClickHouse) and relational databases.
  • Knowledge of cloud platforms (AWS, OCI, GCP) and containerized environments
  • (Kubernetes, Docker).
  • Experience with network monitoring using NPM tools is a plus.

Experience & Soft Skills

  • 8+ years of experience in monitoring, observability, or infrastructure operations.
  • Proven experience leading technical teams and driving cross-functional initiatives.
  • Strong analytical mindset and problem-solving skills.
  • Excellent written and verbal communication able to explain complex ideas clearly to technical
  • and non-technical audiences.
  • Preferred Certifications
  • BMC Certified Professional, Certified Kubernetes Administrator (CKA), or equivalent
  • certifications preferred.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
BMC Software logo
BMC Software

IT Services and IT Consulting

Houston Texas

RecommendedJobs for You