Site Reliability Engineer

3 - 6 years

0 Lacs

Posted:1 day ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

This role is for one of Weekday's clientsLocation: ChennaiJobType: full-time

Requirements

What will you do?

  • We're looking for a self-motivated, enthusiastic, and hands-on engineer to set up solid DevOps and SRE foundations. If you thrive in a small, high-energy team and want to play a key role in shaping infrastructure and reliability at scale, this is the place for you
  • We're looking for a hands-on engineer with 3-6 years of experience who has a solid grasp of cloud infrastructure, a strong foundation in Infrastructure as Code (IaC), and a keen eye for choosing the right tools for the job. You'll help design, build, and scale resilient infrastructure for a fast-growing, product-driven team
  • Design, build, and manage cloud infrastructure using Infrastructure as Code (IaC) tools like Terraform, Ansible, Chef, or CloudFormation
  • Champion observability by defining SLIs, SLOs, and building robust monitoring, logging, and alerting systems using tools like Prometheus, Grafana, and custom telemetry
  • Ensure availability, scalability, and resilience of our SaaS platform and platform services in production
  • Proven ability to improve system observability through the design and instrumentation of system-level metrics, enhancing visibility into system health, performance, and bottlenecks
  • Dive deep into complex system architectures to solve critical performance and reliability challenges
  • Work with developers and product teams to embed NFR (Non-functional Requirements) into every product and feature release
  • Conduct root cause analysis and system-level debugging (primarily on Linux)
  • Build and maintain CI/CD pipelines, automating deployments and infrastructure operations across environments
  • Scale infrastructure to meet growth needs while optimizing cost and performance
  • Take ownership of incident response, on-call rotations, and blameless postmortems
  • Collaborate cross-functionally to drive technical and architectural decision
  • Highly self-driven, accountable, and eager to own initiatives end-to-end. Comfortable working in startups or small teams, where flexibility, speed, and autonomy are key. Strong communication and cross-team collaboration skills

You should apply if

  • Proficient in at least one programming language — Python, Java, or similar
  • Demonstrated experience with performance optimization, latency reduction, and scaling services
  • Strong analytical skills for incident debugging, log analysis, and system troubleshooting
  • Understanding of service-level metrics (SLIs, SLOs, error budgets) and how to operationalize them
  • Experience building large-scale, distributed, resilient systems
  • Strong understanding of core infrastructure components such as load balancers, firewalls, and databases — including their internal workings and operational fundamentals
  • Solid understanding of infrastructure cost management — proactively identifies cost drivers, implements optimization strategies, and contributes to cost reduction initiatives without compromising reliability or performance
  • Familiarity with on-call responsibilities, incident management, and root cause analysis
  • Strong experience with Infrastructure as Code (IaC): Terraform, Ansible, Chef, or CloudFormation and other orchestration tools
  • Ability to deep-dive into third-party or internal library codebases to understand internal behavior, debug complex issues, and contribute insights or fixes when needed
  • Solid understanding of cloud platforms — preferably AWS, but Azure or GCP is also acceptable

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

hyderabad, telangana, india