Industry Consulting Manager

15 - 20 years

13 - 17 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

We are currently seeking a Industry Consulting Manager to join our team in Noida, Uttar Pradesh (IN-UP), India (IN).

Position Title: Technical Architect Observability & Site Reliability Engineering (SRE)

Location:

Experience:

Employment Type:

Role Overview

Technical Architect

create architecture blueprints

Key Responsibilities

  1. Architecture & Blueprinting

  • Design and deliver

    end-to-end observability architecture

    (Metrics, Logs, Traces, Events) for cloud-native and hybrid environments.
  • Create

    technical architecture diagrams

    , data flow maps, and integration blueprints using tools like Lucidchart, Draw.io, or Visio.
  • Lead the definition of

    SLIs, SLOs, and Error Budgets

    aligned with business KPIs and DORA metrics.
  1. Toolchain Strategy & Implementation

  • Architect telemetry pipelines using OpenTelemetry Collector and Splunk Observability Cloud (SignalFx, APM, RUM, Log Observer).
  • Define tool adoption strategy and integration roadmap for OSS tools (Prometheus, Loki, Grafana, Jaeger) and Splunk-based stacks.
  • Guide teams on instrumentation approaches (auto/manual) across languages like Java, Go, Python, .NET, etc.
  1. Reliability Engineering Enablement

  • Lead adoption of SRE principles including incident management frameworks, resiliency testing, and runbook automation.
  • Collaborate with DevOps to integrate observability into CI/CD pipelines (e.g., Jenkins, ArgoCD, GitHub Actions).
  • Define health checks, golden signals, and SPoG (Single Pane of Glass) dashboards.
  • Exposure to

    AIOps

    , ML-based anomaly detection, or business observability.
  1. Stakeholder Management & Governance

  • Serve as a technical liaison between client leadership, SREs, developers, and infrastructure teams.
  • Run workshops, assessments, and evangelize observability-first culture across teams.
  • Provide guidance on data retention, access control, cost optimization, and compliance (especially with Splunk ingestion policies).
  1. Performance & Optimization

  • Continuously monitor and fine-tune observability data flows to prevent alert fatigue and ensure actionability.
  • Implement root cause analysis practices using telemetry correlation across metrics, logs, and traces.
  • Lead efforts to build self-healing systems using automated playbooks and AIOps integrations (where applicable).

Required Skills & Qualifications

  • 15+ years in IT, with

    5 years in Observability/SRE architecture roles

  • Proven experience designing architecture for

    microservices, containers (Docker, Kubernetes), and distributed systems

  • Strong hands-on expertise with:
    • Splunk Observability Cloud

      (SignalFx, Log Observer, APM)
    • OpenTelemetry (SDKs + Collector)

    • Prometheus + Grafana

    • Jaeger / Zipkin

      for distributed tracing
    • CI/CD tools

      : Jenkins, GitHub Actions, ArgoCD
  • Ability to build and present

    clear architecture diagrams and solution roadmaps

  • Working knowledge of

    cloud environments

    (AWS, Azure, GCP) and container orchestration (K8s/OpenShift)
  • Familiarity with

    SRE and DevOps best practices

    (error budgets, release engineering, chaos testing)

Nice to Have

  • Splunk certifications: Core Consultant, Observability Specialist, Admin
  • Knowledge of ITIL and modern incident management frameworks (PagerDuty, OpsGenie)
  • Experience in

    banking or regulated enterprise environments

Soft Skills

  • Strong leadership and

    cross-functional collaboration

  • Ability to work in ambiguous, fast-paced environments
  • Excellent

    documentation and communication skills

  • Passion for mentoring teams and

    building best practices at scale

Why This Role Matters

Observability and SRE ecosystem

  • Unifying legacy and modern telemetry stacks
  • Driving reliability-first mindset and tooling
  • Establishing a scalable blueprint for production excellence

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
NTT DATA, Inc. logo
NTT DATA, Inc.

IT Services and IT Consulting

Tokyo Plano

RecommendedJobs for You