Manager Observability

8 years

0 Lacs

Posted:6 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

About Role We are seeking a highly skilled and experienced Senior Manager of Observability to lead our observability strategy, helping to improve and maintain the health and performance of our systems, applications, and infrastructure. The ideal candidate will have a strong background in systems engineering, observability platforms, and a deep understanding of how to collect, analyze, and interpret system data to drive proactive performance monitoring, issue resolution, and scalability improvements. This is a leadership role that requires technical expertise, collaboration, and the ability to work across teams to ensure a seamless and efficient observability experience. Key Responsibilities: - Lead Observability Strategy: Develop, implement, and refine the company's observability framework, ensuring robust visibility into all production systems and services, including logging, metrics, traces, and alerts. - Team Leadership: Manage and mentor a team of engineers, guiding them in building scalable observability solutions and fostering a culture of continuous learning and improvement. - Incident Management: Oversee the design and implementation of alerting and incident response processes, ensuring rapid identification, diagnosis, and resolution of system issues and downtime. - Tool Selection and Integration: Evaluate, select, and integrate observability tools, ensuring they align with business goals and infrastructure requirements. Oversee platform configurations, custom metrics, and data pipelines for observability tools like Prometheus, Grafana, Datadog, etc. - Collaboration: Work closely with engineering, DevOps, and IT teams to define key performance indicators (KPIs) and ensure visibility into the reliability and performance of systems. - Data-Driven Decisions: Promote a data-driven approach to decision-making, leveraging observability data to drive proactive system optimization and feature improvements. - Automation and Scalability: Ensure that observability solutions scale with the growing needs of the organization, automating processes and optimizing resource usage. - Reporting and Insights: Provide regular reports to leadership on system health, performance trends, and incident root cause analysis, offering insights on improvements and business impact. Qualifications: - Bachelor's or Master's degree in Computer Science, Engineering, or related field (or equivalent experience). - 8+ years of experience in a technical leadership role, with at least 5 years in an observability-focused position. - Expertise in observability tools (e.g., BMC True sight/Helix, Entuity, Datadog, Splunk, New Relic, etc.). - Good understanding of distributed systems, microservices architecture, and cloud environments (AWS, Azure, GCP). - Hands-on experience with monitoring, logging, and tracing systems, as well as experience building and optimizing dashboards, alerting, and reporting frameworks. - Experience with incident management and on-call rotation processes. - Proficient in coding and scripting (Python, Go, Bash, etc.) for automating observability tasks and data pipelines. - Strong leadership skills with a track record of managing and mentoring high-performing teams. - Excellent communication skills, with the ability to clearly convey complex technical concepts to non-technical stakeholders. Preferred Qualifications: - Experience with containerization (Docker, Kubernetes) and container orchestration platforms. - Familiarity with CI/CD pipelines and DevOps best practices. - Experience with chaos engineering and resiliency testing. - Certifications in cloud platforms (AWS Certified Solutions Architect, Google Cloud Professional Cloud Architect, etc.). Show more Show less

Mock Interview

Practice Video Interview with JobPe AI

Start Strategy Interview Now

My Connections Ensono

Download Chrome Extension (See your connection in the Ensono )

chrome image
Download Now
Ensono
Ensono

IT Services and IT Consulting

Downers Grove Illinois

1001-5000 Employees

51 Jobs

    Key People

  • Jeffrey W. Wurst

    Chief Executive Officer
  • Norman H. Dyer

    Chief Financial Officer

RecommendedJobs for You

Pune, Maharashtra, India

Pune, Maharashtra, India