Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in bengaluru
>
Okta
>
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Okta

5 - 10 years

12 - 16 Lacs

bengaluru

Posted:-1 days ago| Platform:

Apply

Skills Required

kubernetes project leadership docker terraform aws container python production systems sre reliability microsoft azure site reliability engineering microservices nosql sql ansible devops jenkins troubleshooting software engineering

Work Mode

Hybrid

Job Type

Full Time

Job Description

As an

SRE Engineer

, you will champion all things pertaining to reliability at Okta on our Auth0 product. Working closely with the product engineers, quality engineers, platform engineers, and architecture teams, your primary focus will be on ensuring production systems remain operational at all times, while continually setting and achieving long-term performance, reliability, and scalability goals in a platform with a growth plan for the coming years.

You will play a key role in Auth0s dedication to ensuring customers uninterrupted access to business-critical enterprise and consumer applications. This is a hands-on role where you will directly operate, troubleshoot, and scale our production systems by responding to monitoring alerts and managing incidents as part of a team's 24/7 on-call rotation. Your work is critical to meeting the demands of ever-increasing traffic and user growth for our customers who rely on us to provide a reliable product experience.

You will:

Proactively identify and drive initiatives
within the teams charter to improve the reliability, scalability, and operational efficiency of our systems, empowering the team to implement solutions.
Participate in a global on-call rotation featuring a follow-the-sun model on weekdays and a lower-frequency, shared rotation for weekends to remediate incidents on critical systems.
Lead team-scoped projects
, taking responsibility for planning, execution, and delivery of key reliability improvements.
Use existing monitoring tools to identify problems and resolve and/or escalate to service teams
Implement changes to enable or improve infrastructure resilience, monitoring, and alerting
Develop and continuously refine SRE tools and processes to improve software delivery, observability, reliability, and operational efficiency.
Optimize existing systems and eliminate toil through simplification and automation.
Define, document, and advocate reliability best practices and policies
Mentor junior and peer SREs
through pair programming, design discussions, and code reviews to level up the team's technical capabilities.

You might be a good fit if you:

Have 5+ years of industry experience as a Site Reliability Engineer, supporting large-scale, mission-critical applications in a production cloud environment.
Believe in the SRE mindset: you are data-driven, embrace a blameless culture, and approach operational problems with a software engineering approach.
Have demonstrable experience participating in a 24/7 on-call rotation.
Possess deep expertise in a major cloud provider (Azure, AWS).
Have demonstrable experience managing infrastructure as code with Terraform at scale.
Have a strong understanding of cloud-native architecture, including containers (Docker, Kubernetes), microservices, modern networking concepts, and various database technologies (SQL, NoSQL, etc.).
Demonstrate proficiency in Go, with proven experience building and maintaining production-grade software, tools, and automation.
Have a systematic problem-solving approach, coupled with a strong sense of ownership and the drive to see complex issues through to resolution.

Possess exceptional proficiency in verbal and written English, allowing you to drive clarity during high-pressure incidents and articulate complex concepts.
Have strong interpersonal and collaboration skills, with a proven ability to build relationships and work effectively in a globally distributed, remote-first team.
Demonstrate a passion for mentoring other engineers, helping them develop their technical and operational skills.
Show a strong interest in taking on project leadership, with a desire to own initiatives from planning through to successful delivery.

More Jobs at Okta

Manager, Software Engineering

Bengaluru, Karnataka, India

Experience: Not specified

Salary: Not disclosed

Senior Analyst, Field Analytics (Bengaluru)

Bengaluru, Karnataka, India

Experience: Not specified

Salary: Not disclosed

People Systems Manager, Workday

Bengaluru, Karnataka, India

Experience: Not specified

Salary: Not disclosed

Software Engineer in Test, SDET 2

Bengaluru, Karnataka, India

Experience: Not specified

Salary: Not disclosed

Senior Full Stack Engineer (Auth0)

Bengaluru, Karnataka, India

Experience: Not specified

Salary: Not disclosed

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Okta

Login to

Please Verify Your Phone or Email

Confirm Action

Senior Site Reliability Engineer