Job
Description
About The Role :
Job Title - Senior Production Support Lead, VP
Location - Pune, India
Role Description
We are seeking a
Senior Production Support Leadto take full ownership of L1 and L2 support operations across three strategic enterprise platforms:
API Management Platform, Kafka-as-a-Service (KaaS) and Data Platform.This is a
senior leadership roletaking ownership of existing global support support team, ensuring
24X7 operational excellence, efficient incident resolution, and strong collaboration with engineering and business stakeholders. The role emphasises
leadership, process optimisation, and operational reliability,rather than team creation.The successful candidate will have the depth to lead day-to-day production support and foresight to shape the future support model, embedding best-in-class practices such as automation, shift-left, and service reliability engineering.
Your key responsibilities
Leadership & Operational Oversight
Take ownership of
existing L1 and L2 support teams, ensuring consistent coverage and adherence to SLAsServer as the
senior escalation pointfor high-priority incident and ensure timely resolution and stakeholder communicationCollaborate with engineering teams to ensure
operational readiness for new features and releasesDrive
continuous improvement in support processes, knowledge base usage, and incident response efficiencyDefine and track Service Level Objectives (SLOs) across supported platforms, ensuring alignment with business expectations and operational goals
Incident & Problem Management
Oversee
24X7 monitoring, alert triage, and incident resolutionfor all platformsEnsure
L1 team handle alerts, health checks and service requestsefficiently, while
L2 team manages deeper investigations and diagnosticsLead p
ost-incident reviews, ensuring actionable improvements and lessons learned are appliedProactively identify
recurring issues and reliability gaps, coordinating with L2/Platform engineering for resolution.
Team & Stakeholder Engagement
Mentor and guide support analysts to improve technical skills, troubleshooting capabilities, and operational disciplineRepresent production support in cross-functional forms, ensuring alignment with business an engineering prioritiesFoster a culture of accountability, operational excellence, and proactive problem-solving
Your skills and experience
Required Skills & Experience
12+ years of IT experience, with 5+ years in
senior support or operations leadershipExperience managing
existing global L1/L2 support teams, ideally in multi-platform environmentsStrong expertise in
incident, problem, and change management frameworksProven ability to lead during
major incidents/outages, with clear executive communicationStrong cross-cultural leadership experience across distributed teams
Technical Skills
Cloud & ContainersGoogle Cloud Platform (GCP), Kubernetes/Anthos
API ManagementApigee, REST, OAuth2/OpenID Connect
Messaging & StreamingKafka, Zookeeper, Schema Registry
Data Platforms Tools:Spark, Big Query, Airflow, SQL
ObservabilityLogging (Splunk, Cloud Logging), Monitoring (NewRelic, GCO, Prometheus, Grafana), Alerting
ITSM & AutomationServiceNow/JIRA, Ansible/Terraform, GitOps, CI/CD
Scripting & TroubleshootingBash, Python, SQL for operational tasks
Resilience & RecoveryHigh-availability setups, DR Planning, cluster scaling and tuning
Preferred Qualifications:
ITIL Expert, SRE Practitioner, or equivalent certificationsExperience
optimising and scaling existing support teamsKnowledge of
shift-left strategies, automation, and reliability engineering practices