Site Reliability Engineer/Architect - CI/CD Pipeline

10 years

0 Lacs

Posted:4 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Job Summary

We are seeking an experienced Site Reliability Engineer (SRE) Architect with over 10 years of IT experience, specializing in designing and implementing highly scalable, reliable, and automated systems.The ideal candidate will have strong expertise in cloud-native architectures, automation, monitoring, and SRE practices.This role requires excellent leadership, technical depth, and the ability to guide large-scale enterprise reliability initiatives.

Key Responsibilities

  • Design and implement scalable, reliable, and automated infrastructure solutions.
  • Lead SRE initiatives across multiple teams, ensuring adherence to SRE principles (SLIs, SLOs, SLAs).
  • Drive incident management, root cause analysis, and postmortem processes.
  • Define and implement observability standards (monitoring, logging, alerting).
  • Collaborate with development and operations teams to improve system reliability and performance.
  • Automate infrastructure provisioning and deployments using IaC (Terraform, Ansible, etc.).
  • Build and optimize CI/CD pipelines for zero-downtime deployments.
  • Ensure high availability, fault tolerance, and disaster recovery strategies.
  • Establish performance benchmarks, load testing, and capacity planning.
  • Provide leadership and mentorship to SRE and DevOps teams.

Required Skills & Qualifications

  • 10+ years of IT experience with at least 5 years in SRE/DevOps roles.
  • Expertise in cloud platforms: AWS, Azure, or GCP.
  • Strong knowledge of Kubernetes, Docker, and microservices architecture.
  • Hands-on experience with Infrastructure as Code (Terraform, Ansible, CloudFormation).
  • Proficiency in programming/scripting languages such as Python, Go, or Bash.
  • Experience with monitoring tools (Prometheus, Grafana, ELK, Datadog, Dynatrace).
  • Strong background in CI/CD pipeline design and automation (Jenkins, GitHub Actions, GitLab CI).
  • In-depth knowledge of networking, load balancers, DNS, and security best practices.
  • Excellent problem-solving and incident management skills.
  • Strong leadership and stakeholder management abilities.

Preferred Qualifications

  • Certified Kubernetes Administrator (CKA) or AWS/Azure/GCP Cloud Architect certification.
  • Experience in large-scale distributed systems design.
  • Background in performance engineering and chaos engineering.
  • Knowledge of ITIL practices for incident, problem, and change management.
(ref:hirist.tech)

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You