GitHub action +Devops; DevOps and SRE; APM; GCP Monitoring

7 - 12 years

1 - 2 Lacs

Posted:3 weeks ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Lead the design, implementation, and continuous improvement of DevOps and IaC pipelines using Terraform on GCP, integrated with Azure DevOps.

Own end-to-end delivery of infrastructure and application releases, change propagation, and production deployments.

Oversee monitoring, alerting, and observability across:

Network: VPC flow logs, firewall ingress/egress, latency, and DNS resolution

Infrastructure: VM/compute instance health, memory/CPU thresholds, autoscaling, IAM, and storage

Applications: Performance, error rates, response times, and transaction health using AppDynamics

Drive the setup and maintenance of observability dashboards and alerts, using tools such as:

GCP Operations Suite (Stackdriver) Logs, Monitoring, Uptime checks

Prometheus & Grafana Infrastructure and custom application metrics

AppDynamics End-to-end application performance monitoring (APM), user journey insights, and deep-dive diagnostics

Azure Monitor & Log Analytics Visibility into hybrid cloud resources

Implement and manage a 16x5 incident response model, including:

On-call rotation and coverage planning

Triage, escalation, and war room management

Incident communication and stakeholder updates

RCA documentation and preventive action tracking

Ensure documentation of runbooks, SOPs, and proactive health checks.

Track and report SLO/SLA compliance, error budgets, and availability KPIs.

Coach and lead the DevOps/SRE team on modern delivery practices and reliability engineering.

Collaborate with cloud architects, product teams, and security leads to ensure resilient and compliant delivery practices.

Required Skills & Experience

7+ years in IT, with 3+ years leading DevOps/SRE or cloud infrastructure delivery teams.

Strong experience with Terraform for automating GCP infrastructure provisioning.

Hands-on with Azure DevOps for CI/CD pipeline development and release automation.

Deep understanding of GCP services (GKE, GCE, IAM, VPC, CloudSQL, etc.).

Experience with AppDynamics for application performance monitoring (APM) and business transaction tracing.

Familiarity with infrastructure and network monitoring tools such as GCP Monitoring, Prometheus, and Grafana.

Proven ability to manage incident response processes in a 16x5 support environment.

Excellent communication, team leadership, and cross-functional stakeholder management skills.

Nice to Have:

Certifications: GCP Professional Cloud DevOps Engineer, Terraform Associate, or Azure DevOps Engineer Expert.

Experience in regulated domains like healthcare

Knowledge of security and compliance integrations in CI/CD (e.g., Veracode, Checkov).

Exposure to Kubernetes, service mesh, or GitOps-based workflows.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You