Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in bengaluru
>
Tally Solutions
>
Site Reliability Engineer

Site Reliability Engineer

Tally Solutions

1 - 3 years

9 - 13 Lacs

bengaluru

Posted:7 hours ago| Platform:

Apply

Skills Required

site reliability redis sales docker ansible incident response git postgresql iam ec2 devops linux shell scripting prometheus architecture s3 python sre django grafana bash terraform aws tally infrastructure as code finance

Work Mode

Hybrid

Job Type

Full Time

Job Description

What You Will Own

As a Site Reliability Engineer, you will help strengthen our infrastructure, automation, and monitoring stack. You will work closely with the Senior Platform Engineer and the backend team to ensure our services run reliably, efficiently, and securely.

Initially, your focus will be on building and maintaining strong monitoring and observability across our systems. Over time, youll expand into broader platform engineering responsibilities such as infrastructure automation, CI/CD optimization, cost management, and reliability engineering.

This is a great opportunity for someone who wants to learn modern platform practices hands-on and grow into a well-rounded DevOps or reliability engineer.

Experience You Should Bring

2-3 years of experience in platform, DevOps, or SRE-type roles.
Basic understanding of AWS (EC2, S3, CloudWatch, IAM).
Familiarity with Linux systems and shell scripting.
Experience with Docker and Git-based CI/CD workflows.
Exposure to monitoring or observability tools (Grafana, Prometheus, InfluxDB, Sentry).
Understanding of Python or Django-based applications is a plus.
Experience with Celery, Redis, PostgreSQL.
Exposure to Terraform, Ansible, or similar tools.
Basic knowledge of networking and security fundamentals.
Scripting in Python or Bash for automation tasks.

What You Will Be Doing

Monitoring & Observability

Implement and maintain monitoring for AWS resources, services, and network health.
Integrate and manage application-level monitoring via Sentry and similar tools.
Ensure visibility into Celery workers, background jobs, and API performance.
Maintain dashboards and alerts for infrastructure, application, and business metrics.

Infrastructure & Automation

Assist in managing and automating AWS environments (EC2, S3, IAM, CloudWatch).
Support configuration and deployment workflows using Docker and CI/CD pipelines.
Help define and improve infrastructure-as-code practices (Terraform, Ansible, etc. as applicable).

Reliability & Incident Response

Contribute to the incident management process and postmortem reviews.
Develop basic health checks, automated recovery scripts, and monitoring rules.
Support scaling, resilience, and performance improvements across environments.

Data & Business Systems Health

Track health of internal data syncs.
Build basic checks and reports to ensure data integrity and freshness.
Support product and analytics teams in collecting business metrics (InfluxDB, Grafana, etc.).

Growth & Learning

Work closely with senior engineers to understand production architecture and best practices.
Continuously learn new tools and approaches in observability, automation, and cloud operations.
Contribute to improving internal documentation and platform standards.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Tally Solutions

Login to

Please Verify Your Phone or Email

Confirm Action

Site Reliability Engineer