Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in hyderabad
>
F5
>
Sr Site Reliability Engineer

Sr Site Reliability Engineer

7 - 12 years

10 - 14 Lacs

hyderabad

Posted:4 days ago| Platform:

Apply

Skills Required

software engineer kubernetes python data management nginx issue resolution sre cybersecurity docker ansible alerting cloud configuration management git linux bash terraform container orchestration aws infrastructure as code azure

Work Mode

Work from Office

Job Type

Full Time

Job Description

The Senior Site Reliability Engineer will be responsible for ensuring the reliability, availability, and scalability of critical NGINX systems and SaaS platforms. Systems under the care of a Senior Site Reliability Engineer must operate effectively and reliably through scalable builds and deployments, frequent releases, and complex architectures that encompass modern technologies. You will work closely with technical and non-technical teams throughout the organization to facilitate the design and implementation of scalable solutions, drive automation initiatives, and monitor and maintain the performance of critical NGINX systems.

We are looking for someone who has:

Experience solving problems related to large-scale distributed systems; is able to take complex problems and identify potential solutions, knowns, and unknowns.
Works to drive continuous improvement and efficiency.
Ability to write code in multiple languages, choosing the right strongly or dynamically typed language for the job.

Responsibilities:

Leads a project team, providing direction, issue resolution, and mentorship, as well as regular progress updates and reporting.
Solve problems in mission critical services; implement solutions to prevent recurrence; lead Retrospectives to explore and understand root causes, define next steps to avoid future incidents, and document and report findings.
Help shape SRE strategies by evaluating and contributing to product/service design. Participate in system design meetings, capacity planning, launch reviews, etc. to ensure support services/platforms are as efficient as possible before going live.
Scale systems sustainably through mechanisms such as automation and evolve systems by fostering changes that improve reliability and velocity.
Enhance data-driven engineering culture by providing statistical trends and analysis using real service data to increase service health and quality.

Knowledge, Skills, and Experience

Bachelor's (or higher level) degree in one or more of these disciplines: Computer Science, Computer Engineering, or related fields.
7+ years of professional experience in software engineering
Experience setting up and using incident and on-call management systems.
Experience setting up and building tools to collect and visualize data (logs, metrics, alerts), building dashboards, alerting, and monitoring systems.
Experience with deploying secure infrastructure and services in one or more cloud environments such as AWS or Azure.
Experience with configuration management and deployment automation tools, such as Terraform, Ansible, Packer, etc.
Proficiency in scripting languages such as Python and Bash.
Experience with container (Docker) and orchestration systems (Kubernetes).
Solid understanding of Linux OS + systems administration skills
Excellent analytical and trouble-shooting skills.
Dynamic collaborator who thrives in diverse, geographically distributed locales.
Team player that demonstrates diplomacy, promotion of sound ideas & concepts, paired with the desire to help others grow their skills.
Strong verbal and written communication skills.
Experience with NGINX technologies a strong plus.

Fundamental competencies:

SYSTEM EXPERIENCE

Application Build and Deployment Processes (git*, automation pipelines, Infrastructure as code, etc.)
Automated Application Delivery (load balancers, container orchestration, service mesh, High Availability architectures, Frontend, Backend technologies including database, etc.)
Service Operation (Define, instrument, measure, and manage service level objectives. Experience with observability tooling including logging infrastructure, time series metrics databases, tracing systems, alert definitions, etc.)
Incident management (service restoration, root cause analysis, postmortem authorship, define roles and responsibilities, etc.)
Security awareness and competencies, including security as code.
Configuration management

OBSERVABILITY

Explores beyond the obvious to ensure Service Level Objectives (SLO) are met.
Understands and measures system behaviors to quickly and efficiently diagnose, identify, and address needs.
Proactively test, automate, monitor outputs, leverage signals to infer services and needs.
Data management to explore properties, patterns, and distributed tracing

SOLUTIONIST

Constantly seeking ways to improve systems, making them more efficient and reducing toil.
Understands the difference between short-term strategic and long-term fixes
Simplifies decisions and judgments by recognizing what to pay attention to and what to ignore; a proficient problem solver. Tenacious and resourceful with an inherent predisposition toward action; unafraid to try something new in the name of innovation.

FORWARD THINKING

Possess an inherent bias toward innovation, always abreast of developing ideas and technologies. Thoughtfully and strategically considers future needs, opportunities, and advocates positive change.
Technological creativity and capacity

COMMUNICATION AND COLLABORATION

Conveys information, vision, and strategy in an accurate and timely manner, adjusting to ensure understanding based on the audience. Actively listens; seeks to understand rather than respond. Proactively solicits and values diverse perspectives, ideas, and opinions

More Jobs at F5

Technical Training Developer

Hyderabad

2 - 5 yrs

INR 4 - 8 Lacs

Sr. Principal Site Reliability Engineer

Bengaluru

7 - 11 yrs

INR 17 - 22 Lacs

Engineer III, Software

Hyderabad

2 - 6 yrs

INR 8 - 12 Lacs

Network Support Engineer (Hyderabad)

Hyderabad

2 - 5 yrs

INR 4 - 7 Lacs

Sr Engineer, Software

Bengaluru

3 - 6 yrs

INR 6 - 11 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Software Engineer Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Login to

Please Verify Your Phone or Email

Confirm Action

Sr Site Reliability Engineer