Senior Engineering Manager, Self-Hosted Kafka Infrastructure

5 - 10 years

9 - 14 Lacs

Posted:2 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Qualifications Must-Haves
  • Management Experience:

    Over 5 years managing engineering teams, with a record of hiring and developing senior talent.
  • Kafka Mastery:

    Deep knowledge of Apache Kafka internals and experience operating Kafka at scale.
  • Kubernetes Cloud:

    Experience managing stateful workloads on Kubernetes in public clouds (AWS, GCP, or Azure).
  • Automation Mindset:

    History of prioritizing "software over operations"; experience building operators/control planes (Go, Java, or Python).
Nice-to-Haves
  • Experience with Apache Kafka or its ecosystem tools (Strimzi, Cruise Control).
  • Experience with Service Mesh (Istio or Linkerd) for traffic management.
Success Metrics (First 12 Months)
  • Successful transition:

    Complete migration of the first critical services to the self-hosted platform.
  • Control plane MVP:

    Deliver the initial centralized control plane managing cluster lifecycle.
  • Operational improvement:

    Cut manual operator intervention by 50% through automation.
  • Cost optimization:

    Reduce infrastructure cost per MB/s throughput versus the previous managed solution.
Key Responsibilities 1. Technical Strategy Control Plane Engineering
  • Control Plane Development:

    Design and implement the Kafka Control Plane to automate provisioning, scaling, upgrading, and self-healing.
  • Migration Execution:

    Lead the transition of critical workloads from cloud-managed solutions to the self-hosted platform.
  • Kafka Internals:

    Guide the team on Kafka internals (KRaft migration, partition rebalancing, tiered storage).
  • Cloud Infrastructure:

    Manage stateful streaming workloads on Kubernetes (EKS/GKE/AKS) across multi-cloud regions.
2. People Management Culture
  • Empathetic Leadership:

    Lead, mentor, and develop a team of senior distributed systems engineers, fostering high performance and psychological safety.
  • Operational Excellence:

    Set rigorous standards to ensure the team prioritizes engineering fixes for operational toil and maintains healthy on-call rotations.
3. Cross-Organization Collaboration
  • Architecture Partnership:

    Collaborate with Product Engineering, Data Platform, and AI/ML teams to optimize streaming architecture, reducing latency and cost.
  • Client Advocacy:

    Support client services migration by tuning producers and consumers for optimal performance on the new platform.

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Atlassian logo
Atlassian

Software Development

Sydney NSW

RecommendedJobs for You

hubli, mangaluru, mysuru, bengaluru, belgaum