Site Reliability Engineer

7 - 9 years

0 Lacs

Posted:19 hours ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE)

Key Responsibilities

  • Cloud Infrastructure (AWS):

  • Design, implement, and manage scalable, resilient, and cost-optimized cloud infrastructure using AWS services (EC2, EKS, Lambda, RDS, S3, CloudFront, IAM, VPC, etc.).
  • Implement Infrastructure as Code (IaC) using tools like

    Terraform / CloudFormation

    .
  • DevOps & Automation:

  • Build and maintain

    CI/CD pipelines

    (Jenkins, GitHub Actions, GitLab CI, or AWS CodePipeline) for automated deployments.
  • Automate repetitive tasks to improve development velocity and operational efficiency.
  • Observability & Monitoring:

  • Define and implement

    observability strategy

    covering monitoring, logging, tracing, and alerting.
  • Work with tools like

    Prometheus, Grafana, ELK/EFK stack, AWS CloudWatch, Datadog, New Relic, Splunk, or Dynatrace

    .
  • Establish

    SLIs, SLOs, and SLAs

    to measure and improve system reliability.
  • Site Reliability Engineering (SRE):

  • Drive incident management processes detection, alerting, root cause analysis, and postmortems.
  • Apply

    chaos engineering

    principles to validate resilience and recovery.
  • Optimize reliability, latency, scalability, and system efficiency.
  • Security & Compliance:

  • Implement best practices for cloud security, identity & access management, and compliance frameworks (ISO, SOC2, GDPR, etc.).
  • Ensure observability and monitoring meet security and audit requirements.
  • Collaboration & Leadership:

  • Partner with development, QA, and product teams to ensure seamless deployments.
  • Mentor junior engineers and promote a culture of

    reliability, automation, and continuous improvement

    .

Required Skills & Qualifications

  • 7+ years

    of professional experience in DevOps, Cloud Infrastructure, or SRE roles.
  • Strong expertise in AWS Cloud

    (certification preferred: AWS Certified DevOps Engineer, Solutions Architect, or SysOps).
  • Proficiency in

    IaC tools

    (Terraform, CloudFormation).
  • Solid experience in

    CI/CD pipeline tools

    (Jenkins, GitHub Actions, GitLab CI/CD, AWS CodePipeline).
  • Hands-on with

    observability tools

    : Prometheus, Grafana, CloudWatch, ELK, Datadog, New Relic, Splunk, or similar.
  • Deep understanding of

    SRE principles

    : SLIs/SLOs, error budgets, incident response, chaos testing.
  • Strong scripting/coding experience (Python, Bash, Go, or similar).
  • Knowledge of

    containers & orchestration

    (Docker, Kubernetes, EKS).
  • Familiarity with

    security best practices

    in cloud-native environments.

Preferred Skills

  • Experience with

    multi-cloud or hybrid-cloud environments

    .
  • Exposure to

    resiliency testing & chaos engineering tools

    (Gremlin, Litmus, Chaos Mesh).
  • Knowledge of cost-optimization and FinOps in AWS.
  • Excellent communication and stakeholder management skills.

What We Offer

  • Opportunity to work on cutting-edge cloud-native architectures.
  • A culture focused on

    automation, reliability, and innovation

    .
  • Growth opportunities with certifications, training, and leadership exposure.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You

bengaluru, karnataka, india