Manager Observability

5 - 9 years

0 Lacs

Posted:2 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: As a Senior Manager of Observability at Ensono, you will be responsible for leading the observability strategy to improve and maintain the health and performance of systems, applications, and infrastructure. Your role will involve developing and refining the observability framework, managing a team of engineers, overseeing incident management, tool selection and integration, collaboration with cross-functional teams, promoting data-driven decisions, ensuring automation and scalability, providing reporting and insights to leadership. Key Responsibilities: - Lead Observability Strategy: Develop, implement, and refine the companys observability framework to ensure visibility into all production systems and services. - Team Leadership: Manage and mentor a team of engineers in building scalable observability solutions and fostering continuous learning. - Incident Management: Oversee alerting and incident response processes for rapid issue identification, diagnosis, and resolution. - Tool Selection and Integration: Evaluate and integrate observability tools like Prometheus, Grafana, Datadog, etc., aligning them with business goals. - Collaboration: Work with engineering, DevOps, and IT teams to define KPIs and ensure visibility into system reliability and performance. - Data-Driven Decisions: Leverage observability data for proactive system optimization and feature improvements. - Automation and Scalability: Ensure observability solutions scale with organizational needs, automate processes, and optimize resource usage. - Reporting and Insights: Provide regular reports to leadership on system health, performance trends, and incident analysis for improvements. Qualifications: - Bachelors or Masters degree in Computer Science, Engineering, or related field. - 8+ years of experience in a technical leadership role, with at least 5 years in an observability-focused position. - Expertise in observability tools like Datadog, Splunk, New Relic, etc. - Good understanding of distributed systems, microservices architecture, and cloud environments. - Hands-on experience with monitoring, logging, and tracing systems, as well as building and optimizing dashboards. - Experience with incident management, on-call rotation processes, and coding/scripting for automating tasks. - Strong leadership skills, excellent communication, and the ability to convey technical concepts clearly. Preferred Qualifications: - Experience with containerization and container orchestration platforms. - Familiarity with CI/CD pipelines and DevOps practices. - Experience with chaos engineering and resiliency testing. - Certifications in cloud platforms like AWS Certified Solutions Architect, Google Cloud Professional Cloud Architect.,

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Ensono logo
Ensono

IT Services and IT Consulting

Downers Grove Illinois

RecommendedJobs for You

Pune, Maharashtra, India

Pune, Maharashtra, India