Home
Jobs

1 Gkeiam Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

7.0 - 12.0 years

27 - 35 Lacs

Bengaluru

Work from Office

Job Overview We are hiring a seasoned Site Reliability Engineer with strong experience in building and operating scalable systems on Google Cloud Platform (GCP). You will be responsible for ensuring system availability, performance, and security in a complex microservices ecosystem, while collaborating cross-functionally to improve infrastructure reliability and developer velocity. Key Responsibilities - Design and maintain highly available, fault-tolerant systems on GCP using SRE best practices. - Implement SLIs/SLOs, monitor error budgets, and lead post-incident reviews with RCA documentation. - Automate infrastructure provisioning (Terraform/Deployment Manager) and CI/CD workflows. - Operate and optimize Kubernetes (GKE) clusters including autoscaling, resource tuning, and HPA policies. - Integrate observability across microservices using Prometheus, Grafana, Stackdriver, and OpenTelemetry. - Manage and fine-tune databases (MySQL/Postgres/BigQuery/Firestore) for performance and cost. - Improve API reliability and performance through Apigee (proxy tuning, quota/policy handling, caching). - Drive container best practices including image optimization, vulnerability scanning, and registry hygiene. - Participate in on-call rotations, capacity planning, and infrastructure cost reviews. Must-Have Skills - Minimum 8 years of total experience, with at least 3 years in SRE, DevOps, or Platform Engineering roles. - Strong expertise in GCP services (GKE, IAM, Cloud Run, Cloud Functions, Pub/Sub, VPC, Monitoring). - Advanced Kubernetes knowledge: pod orchestration, secrets management, liveness/readiness probes. - Experience in writing automation tools/scripts in Python, Bash, or Go. - Solid understanding of incident response frameworks and runbook development. - CI/CD expertise with GitHub Actions, Cloud Build, or similar tools. Good to Have - Apigee hands-on experience: API proxy lifecycle, policies, debugging, and analytics. - Database optimization: index tuning, slow query analysis, horizontal/vertical sharding. - Distributed monitoring and tracing: familiarity with Jaeger, Zipkin, or GCP Trace. - Service Mesh (Istio/Linkerd) and secure workload identity configurations. - Exposure to BCP/DR planning, infrastructure threat modeling, and compliance (ISO/SOC2). Educational & Certification Requirements - B.Tech / M.Tech / MCA in Computer Science or equivalent. - GCP Professional Cl

Posted 3 days ago

Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies