Site Reliability Engineer

3 years

0 Lacs

Posted:11 hours ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Batch is a brand-first technology platform designed to amplify customer engagement, enable frictionless transactions, defend product authenticity, elevate customer loyalty, and ignite customer growth. Our mission is to provide seamless solutions that help businesses build stronger connections with their customers. With a focus on enhancing the customer experience, Batch delivers innovative technology that drives value and fosters long-term success.


Role Description

Site Reliability Engineer (SRE)


Key Responsibilities:

  • Design, deploy, and maintain

    cloud infrastructure

    (AWS) ensuring reliability, scalability, and security.
  • Manage and operate

    Kubernetes clusters

    , ensuring high availability and performance.
  • Implement

    infrastructure as code

    using

    Terraform/Terragrunt

    to automate provisioning and updates.
  • Develop and maintain

    CI/CD pipelines

    using

    GitLab

    for application deployment.
  • Write automation scripts in

    Python

    and

    Bash

    to streamline operational tasks and monitoring.
  • Manage

    Docker-based deployments

    , optimizing container performance and security.
  • Monitor and optimize

    PostgreSQL databases

    (including RDS) for performance, backup, and recovery.
  • Troubleshoot production issues and provide root cause analysis to prevent recurrence.
  • Collaborate with development and operations teams to improve system reliability, deployment processes, and infrastructure efficiency.
  • Implement

    observability solutions

    : logging, metrics, and alerts to ensure system health.


Required Skills & Experience:

  • 3+ years

    of professional experience in Site Reliability, DevOps, or Cloud Engineering roles.
  • Strong expertise in

    Kubernetes

    , including deployments, scaling, and cluster management.
  • Hands-on experience with

    AWS services

    (EC2, RDS, S3, IAM, CloudWatch, etc.).
  • Experience with

    Terraform/Terragrunt

    for infrastructure automation.
  • Proficient in

    GitLab CI/CD pipelines

    and version control workflows.
  • Strong scripting skills in

    Python

    and

    Bash

    for automation and operational tasks.
  • Experience with

    Docker

    containerization and orchestration.
  • Working knowledge of

    PostgreSQL

    and RDS management.
  • Strong problem-solving skills and attention to detail.
  • Good communication skills and ability to collaborate across teams.


Preferred Skills:

  • Experience with

    monitoring and observability tools

    (Prometheus, Grafana, ELK Stack, etc.).
  • Familiarity with

    microservices architectures

    and cloud-native applications.
  • Understanding of

    security best practices

    in cloud infrastructure.
  • Experience with

    event-driven architectures

    and messaging systems (Kafka, RabbitMQ, etc.).

Benefits:

  • Competitive salary and benefits package.
  • Opportunities for career growth and professional development.
  • Work with cutting-edge cloud and DevOps technologies.
  • Collaborative and inclusive work environment.
  • Flexible work arrangements (if applicable).

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You