7 - 12 years
1 - 2 Lacs
Posted:3 weeks ago|
Platform:
On-site
Full Time
Lead the design, implementation, and continuous improvement of DevOps and IaC pipelines using Terraform on GCP, integrated with Azure DevOps.
Own end-to-end delivery of infrastructure and application releases, change propagation, and production deployments.
Oversee monitoring, alerting, and observability across:
Network: VPC flow logs, firewall ingress/egress, latency, and DNS resolution
Infrastructure: VM/compute instance health, memory/CPU thresholds, autoscaling, IAM, and storage
Applications: Performance, error rates, response times, and transaction health using AppDynamics
Drive the setup and maintenance of observability dashboards and alerts, using tools such as:
GCP Operations Suite (Stackdriver) Logs, Monitoring, Uptime checks
Prometheus & Grafana Infrastructure and custom application metrics
AppDynamics End-to-end application performance monitoring (APM), user journey insights, and deep-dive diagnostics
Azure Monitor & Log Analytics Visibility into hybrid cloud resources
Implement and manage a 16x5 incident response model, including:
On-call rotation and coverage planning
Triage, escalation, and war room management
Incident communication and stakeholder updates
RCA documentation and preventive action tracking
Ensure documentation of runbooks, SOPs, and proactive health checks.
Track and report SLO/SLA compliance, error budgets, and availability KPIs.
Coach and lead the DevOps/SRE team on modern delivery practices and reliability engineering.
Collaborate with cloud architects, product teams, and security leads to ensure resilient and compliant delivery practices.
Required Skills & Experience
7+ years in IT, with 3+ years leading DevOps/SRE or cloud infrastructure delivery teams.
Strong experience with Terraform for automating GCP infrastructure provisioning.
Hands-on with Azure DevOps for CI/CD pipeline development and release automation.
Deep understanding of GCP services (GKE, GCE, IAM, VPC, CloudSQL, etc.).
Experience with AppDynamics for application performance monitoring (APM) and business transaction tracing.
Familiarity with infrastructure and network monitoring tools such as GCP Monitoring, Prometheus, and Grafana.
Proven ability to manage incident response processes in a 16x5 support environment.
Excellent communication, team leadership, and cross-functional stakeholder management skills.
Nice to Have:
Certifications: GCP Professional Cloud DevOps Engineer, Terraform Associate, or Azure DevOps Engineer Expert.
Experience in regulated domains like healthcare
Knowledge of security and compliance integrations in CI/CD (e.g., Veracode, Checkov).
Exposure to Kubernetes, service mesh, or GitOps-based workflows.
Tech Mahindra Limited
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.
hyderabad, telangana, india
1.5 - 2.5 Lacs P.A.
hyderabad, chennai, bengaluru
0.5 - 0.9 Lacs P.A.
1.2 - 1.2 Lacs P.A.
india
20.0 - 35.0 Lacs P.A.
srīnagar
Experience: Not specified
0.72 - 0.84 Lacs P.A.
3.6 - 4.8 Lacs P.A.
Experience: Not specified
0.96 - 1.8 Lacs P.A.
ahmedabad
1.62 - 2.22 Lacs P.A.
pune, maharashtra, india
4.0 - 12.0 Lacs P.A.
Experience: Not specified
9e-05 - 0.00014 Lacs P.A.