Site Reliability Engineer

4 - 6 years

4 - 7 Lacs

Posted:1 day ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Site Reliability Engineer (SRE)

Key Responsibilities:

  • Design, implement, and manage scalable, resilient, and secure infrastructure systems.
  • Monitor, maintain, and improve system reliability, availability, scalability, and performance.
  • Build and enhance CI/CD pipelines using tools like

    Jenkins, GitLab CI, or Azure DevOps

    .
  • Develop infrastructure as code using

    Terraform, Ansible, or similar tools

    .
  • Automate operational processes and improve system observability through monitoring and alerting.
  • Troubleshoot and resolve production issues across services and technology stacks.
  • Collaborate with development, QA, and DevOps teams to define SLAs, SLOs, and SLIs.
  • Conduct post-incident reviews and develop action plans to prevent recurrence.
  • Participate in on-call rotations and ensure effective incident response.
  • Ensure security, compliance, and best practices are followed in infrastructure and deployments.

Required Skills:

  • 46 years of hands-on experience in

    Site Reliability Engineering, DevOps

    , or

    System Administration

    roles.
  • Strong proficiency in

    Linux/Unix administration

    .
  • Experience with

    cloud platforms

    such as

    AWS, Azure, or GCP

    .
  • Proficiency in one or more programming/scripting languages (

    Python, Go, Bash

    ).
  • Experience with

    monitoring and alerting tools

    (e.g.,

    Prometheus, Grafana, Datadog, ELK, Splunk

    ).
  • Knowledge of

    containerization

    and orchestration tools (e.g.,

    Docker, Kubernetes

    ).
  • Familiarity with version control systems (e.g.,

    Git

    ).

Preferred Skills:

  • Experience with

    incident management

    and

    root cause analysis

    .
  • Familiarity with

    zero downtime deployments

    and

    blue-green/canary deployments

    .
  • Experience in

    performance tuning

    ,

    load testing

    , and

    resilience engineering

    .
  • Certification in

    cloud platforms (AWS/Azure/GCP)

    is a plus.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Teamware Solutions logo
Teamware Solutions

IT Services and IT Consulting

Chennai Tamilnadu

RecommendedJobs for You

Mumbai Suburban, Thane, Mumbai (All Areas)

Hyderabad, Telangana, India

Bengaluru, Mumbai (All Areas)