Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in bengaluru
>
Weekday AI (YC W21)
>
Site Reliability Engineer

Site Reliability Engineer

Weekday AI (YC W21)

1 years

12 - 20 Lacs

bengaluru karnataka india

Posted:7 hours ago| Platform:

Apply

Skills Required

reliability engineering devops deployment drive analysis collaboration aws azure gcp code terraform storage networking orchestration automation automate scaling failover monitoring configuration management tooling efficiency remediation metrics design planning optimization scalability linux troubleshooting containerization docker kubernetes git datadog scripting python

Work Mode

On-site

Job Type

Full Time

Job Description

This role is for one of the Weekday's clients

Salary range: Rs 1200000 - Rs 2000000 (ie INR 12-20 LPA)

Min Experience: 1 yearsLocation: BengaluruJobType: full-timeAs an SRE, you will work closely with product engineering, DevOps, and platform teams to build resilient services, improve deployment processes, and drive operational excellence across the organization. You will be responsible for maintaining the health of our applications and infrastructure, strengthening reliability practices, and ensuring optimal system performance.

Requirements

Key Responsibilities

Reliability & Performance
Ensure high availability, resilience, and performance of production systems.
Conduct root cause analysis, implement long-term fixes, and reduce recurring incidents.
Develop and tune SLIs, SLOs, and error budgets in collaboration with engineering teams.
Infrastructure & Operations
Build, maintain, and optimize cloud infrastructure (AWS/Azure/GCP).
Implement Infrastructure-as-Code (IaC) using tools like Terraform, CloudFormation, or similar.
Manage compute, storage, networking, load balancers, and container orchestration systems.
Maintain CI/CD pipelines to streamline deployments and operational workflows.
Automation & Tooling
Automate operational tasks, scaling, failover, monitoring, and configuration management.
Develop tooling to improve engineering efficiency and reduce manual interventions.
Implement proactive alerting, self-healing mechanisms, and automated remediation workflows.
Observability & Incident Management
Build end-to-end observability using logs, metrics, traces, and dashboards.
Respond to production issues, participate in on-call rotations, and manage incident lifecycle.
Establish and improve incident response processes, runbooks, and reliability best practices.
Collaboration & Continuous Improvement
Partner with developers to design reliable architectures and production-ready solutions.
Promote SRE principles, reliability mindset, and performance culture across teams.
Contribute to capacity planning, cost optimization, and system scalability initiatives.

Required Skills & Qualifications

1-4 years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
Strong understanding of SRE fundamentals, including SLIs, SLOs, error budgets, and operational maturity models.
Hands-on experience with cloud platforms (AWS/Azure/GCP) and distributed systems.
Proficiency in Linux systems, OS internals, networking concepts, and performance troubleshooting.
Experience with Infrastructure-as-Code tools such as Terraform, CloudFormation, or Pulumi.
Familiarity with containerization (Docker) and orchestration (Kubernetes).
Good understanding of CI/CD pipelines, Git workflows, and release engineering.
Exposure to monitoring and observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
Scripting proficiency in Bash, Python, or Go (preferred).
Strong analytical, problem-solving, and incident-management skills

More Jobs at Weekday AI (YC W21)

Senior Software Engineer - UI/UX

Mumbai Metropolitan Region

4 - 4 yrs

Salary: Not disclosed

Branch Manager

Mehsana, Gujarat, India

3 - 3 yrs

Salary: Not disclosed

Branch Manager

Surat, Gujarat, India

3 - 3 yrs

Salary: Not disclosed

Business Development Manager

Mumbai Metropolitan Region

3 - 3 yrs

Salary: Not disclosed

Presales and Solutions Engineer

Mumbai Metropolitan Region

2 - 2 yrs

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Weekday AI (YC W21)

Login to

Please Verify Your Phone or Email

Confirm Action

Site Reliability Engineer