Senior Site Reliability Engineer

8 years

0 Lacs

Posted:2 months ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Job Title: SRE Lead Engineer

Location: Hyderabad, India


We are seeking a DevOps / SRE Lead Engineer to architect and scale our client's multi-tenant SaaS platform with AI/ML at the core..


Site Reliability Engineering (SRE) Lead Engineer


About the Role


As the SRE Lead Engineer, you will be responsible for architecting, building, and maintaining infrastructure that powers a multi-tenant SaaS platform. You’ll drive reliability, scalability, and security, while supporting AI/ML pipelines in production. This is a hands-on role with significant ownership, requiring both technical depth and leadership in site reliability practices.


Key Responsibilities

  • Architect, design, and deploy end-to-end infrastructure for large-scale, microservices-based SaaS platforms.
  • Ensure system reliability, scalability, and security for AI/ML model integrations and data pipelines.
  • Automate environment provisioning and management using Terraform in AWS (EKS-focused).
  • Implement full-stack observability across applications, networks, and operating systems.
  • Lead incident management and participate in 24/7 on-call rotation.
  • Optimize SaaS reliability while enabling REST APIs, SSO integrations (Okta/Auth0), and cloud data services (RDS/MySQL, Elasticsearch).
  • Define and maintain backup and disaster recovery for critical workloads.


Required Skills & Experience

  • 8+ years in SRE/DevOps roles, managing enterprise SaaS applications in production.
  • Minimum 1 year experience with AI/ML infrastructure or model-serving environments.
  • Strong expertise in AWS cloud, particularly EKS, container orchestration, and Kubernetes.
  • Hands-on experience with Infrastructure as Code (Terraform), Docker, and scripting (Python, Bash).
  • Solid Linux OS and networking fundamentals.
  • Experience in monitoring and observability with ELK, CloudWatch, or similar tools.
  • Strong track record with microservices, REST APIs, SSO, and cloud databases.


Nice-to-Have Skills

  • Experience with MLOps and AI/ML pipeline observability.
  • Cost optimization and security hardening in multi-tenant SaaS.
  • Prior exposure to FinTech or enterprise finance solutions.


Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or related discipline.
  • AWS Certified Solutions Architect (strongly preferred).
  • Experience in early-stage or high-growth startups is an advantage.


Why Join?

  • Be at the forefront of AI/ML-powered SaaS innovation in FinTech.
  • Work with a high-energy, entrepreneurial team building next-gen infrastructure.
  • Take ownership of mission-critical reliability challenges.
  • Grow your career in an environment that values impact, adaptability, and innovation.


If you’re passionate about building secure, scalable, and intelligent platforms, we’d love to hear from you. Apply now to be part of our client’s journey in redefining enterprise finance operations.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You