Platform & Site Reliability Engineer- SSE

7 - 12 years

15 - 20 Lacs

Posted:1 week ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

  • Platform & Site Reliability Engineering- SSE Job Description: Looking for a highly skilled Senior resource to join our Platform & Reliability Engineering team
  • In this role, you will be responsible for designing, building, and maintaining scalable and reliable platform solutions that empower software and AIOps delivery
  • Accelerate AIOps delivery and SRE operations, high-elite performance as measured by DORA

Key Responsibilities

  • Platform Development: Design, implement, and optimize core platform services, APIs, and automation frameworks to support software development, AIOps, and SRE operations
  • Infrastructure as Code (IaC): Develop and maintain infrastructure using tools such as Terraform
  • Cloud Engineering: Architect and optimize cloud-native solutions in GCP and on-prem OpenShift, ensuring reliability, scalability, and cost-efficiency
  • Automation & CI/CD: Implement and enhance CI/CD pipelines using tools such as GitHub Actions, GitLab CI, Jenkins, or ArgoCD to improve software delivery
  • Ensure standardized DORA observability across prioritized development programs using Gathr as the platform
  • Observability & Performance: Establish monitoring, logging, and alerting strategies using Prometheus, Grafana, OpenTelemetry, NewRelic, or similar technologies
  • Enable 100% SLO observability for onboarded services in SRE
  • Security & Compliance: Embed security best practices in platform solutions, including identity management, secrets management, and policy enforcement
  • AIOps & SRE Enablement: Support AIOps 24/7 in production through SRE and enhance automation capabilities for proactive incident resolution
  • Decommissioning & Optimization: Contribute to decommissioning NSO-ONAP tenant software and optimizing platform services
  • Technical Leadership: Provide mentorship and guidance to junior developers and advocate for engineering excellence and DevOps culture

Required Skills & Experience

  • 7+ years of professional software development experience, with at least 3 years focused on platform engineering, DevOps, or SRE
  • Proficiency in at least one programming language such as Python, Go, Java, or Rust
  • Hands-on experience with cloud platforms (GCP and on-prem OpenShift) and cloud-native technologies such as Kubernetes
  • Strong knowledge of Infrastructure as Code (Terraform)
  • Experience with containerization technologies (Docker, Kubernetes, Helm)
  • Expertise in CI/CD tooling and best practices for software deployment and release automation
  • Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, OpenTelemetry, NewRelic, ELK stack)
  • Strong problem-solving skills and a track record of delivering scalable and maintainable solutions
  • Excellent communication and collaboration skills, with experience working in agile environments

Nice to Have:

  • Experience with service meshes and API gateways
  • Knowledge of SRE principles and reliability engineering
  • Experience with FinOps and cost optimization in cloud environments
  • Exposure to policy-as-code frameworks

Skills:

  • DevOps
  • Google Cloud Platform
  • Kubernetes
  • Python
  • Terraform

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
CGI logo
CGI

Information Technology and Consulting

Montreal

RecommendedJobs for You