Site Reliability Engineer

0 years

0 Lacs

Posted:1 day ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Position Overview:

stability, scalability, performance, and reliability

Key Responsibilities:

  • Design, build, and maintain

    highly reliable and scalable systems and infrastructure

    .
  • Automate deployment, monitoring, and maintenance processes using

    DevOps tools and scripts

    .
  • Implement and manage

    CI/CD pipelines

    to support continuous delivery.
  • Monitor application performance, identify bottlenecks, and improve

    uptime and reliability

    .
  • Develop and maintain

    incident response procedures

    , including root cause analysis and postmortems.
  • Collaborate with development teams to design systems for

    fault tolerance, load balancing, and failover

    .
  • Manage and optimize

    cloud infrastructure

    (AWS, Azure, GCP).
  • Implement observability solutions

    logging, metrics, tracing, and alerting

    .
  • Maintain strong

    security and compliance standards

    across infrastructure.
  • Participate in

    on-call rotations

    and ensure 24/7 system availability.
  • Document processes, configurations, and runbooks for operational consistency.

Required Skills & Qualifications:

  • Bachelor's degree in

    Computer Science, Information Technology, or related field

    .
  • Strong knowledge of

    Linux/Unix systems administration

    and

    shell scripting

    .
  • Proficiency with

    automation and configuration tools

    (Ansible, Terraform, Chef, Puppet).
  • Experience with

    cloud platforms

    AWS, Azure, or Google Cloud.
  • Familiarity with

    containerization and orchestration tools

    (Docker, Kubernetes).
  • Solid understanding of

    CI/CD tools

    (Jenkins, GitLab CI, CircleCI).
  • Strong experience with

    monitoring and observability tools

    (Prometheus, Grafana, ELK Stack, Datadog).
  • Knowledge of

    networking fundamentals

    , load balancing, and DNS management.
  • Proficiency in at least one programming language (Python, Go, or Bash).
  • Excellent analytical, problem-solving, and communication skills.

Preferred Qualifications:

  • Experience with

    infrastructure-as-code (IaC)

    and

    serverless architectures

    .
  • Knowledge of

    reliability metrics

    such as SLOs, SLIs, and error budgets.
  • Exposure to

    database administration

    (MySQL, PostgreSQL, MongoDB, Redis).
  • Familiarity with

    security practices

    for cloud-native systems.
  • Certifications such as

    AWS Certified DevOps Engineer

    ,

    Google SRE Certification

    , or

    CKA (Certified Kubernetes Administrator)

    .

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You