SRE Observability Lead

13 - 17 years

0 Lacs

Posted:2 weeks ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: As the SRE Observability Lead Engineer at Services Technology, you will be a hands-on leader responsible for shaping and delivering the future of Observability. Reporting to the Head of SRE Services, you will define the long-term vision, build and scale modern observability capabilities across business lines, and lead a small team of SREs delivering reusable observability services. This role is a blend of leadership and engineering, requiring both strategic vision and technical depth to resolve telemetry challenges across various environments. Key Responsibilities: - Define and own the strategic vision and multi-year roadmap for Observability across Services Technology, in alignment with enterprise reliability and production goals. - Translate strategy into an actionable delivery plan in collaboration with Services Architecture & Engineering function, delivering incremental, high-value milestones towards a unified, scalable observability architecture. - Lead and mentor SREs across Services, fostering technical growth and an SRE mindset. - Build and offer a suite of central observability services across business lines, including standardized telemetry libraries, onboarding templates, dashboard packs, and alerting standards. - Drive reusability and efficiency by creating common patterns and golden paths for observability adoption across critical client flows and platforms. - Work hands-on to troubleshoot telemetry and instrumentation issues across on-prem, cloud, and container-based environments. - Collaborate closely with the architecture function to support implementation of observability non-functional requirements (NFRs) in the SDLC, ensuring new apps go live with sufficient coverage and insight. - Support SRE Communities of Practice (CoP) and foster strong relationships with SREs, developers, and platform leads to accelerate adoption and promote SRE best practices. - Use Jira/Agile workflows to track and report on observability maturity across Services business lines, including coverage, adoption, and contribution to improved client experience. - Influence and align senior stakeholders across functions to drive observability investment for critical client flows. - Lead people management responsibilities for your direct team, including management of headcount, goal setting, performance evaluation, compensation, and hiring. Qualifications: - 13+ years of experience in Observability, SRE, Infrastructure Engineering, or Platform Architecture, with several years in senior leadership roles. - Deep expertise in observability tools and stacks such as Grafana, Prometheus, OpenTelemetry, ELK, Splunk, and similar platforms. - Strong hands-on experience across hybrid infrastructure, including on-prem, cloud, and container platforms. - Proven ability to design scalable telemetry and instrumentation strategies, resolve production observability gaps, and integrate them into large-scale systems. - Experience leading teams and managing people across geographically distributed locations. - Strong ability to influence platform, cloud, and engineering leaders to ensure observability tooling is built for reuse and scale. - Deep understanding of SRE fundamentals, including SLIs, SLOs, error budgets, and telemetry-driven operations. - Strong collaboration skills and experience working across federated teams, building consensus and delivering change. - Ability to stay up to date with industry trends and apply them to improve internal tooling and design decisions. - Excellent written and verbal communication skills; able to influence and articulate complex concepts to technical and non-technical audiences. (Note: Additional details about the company were not provided in the job description.),

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You