Jobs

Interviews
Job Alerts
Tools

Upskill and Grow with AI

Mock Interview Practice interviews in realistic simulations

Coding Practice Improve your coding skills with challenges

Certification Earn certifications to validate your skills

AI Learning Get trained with AI expert sessions

Career Path AI insights for smarter career decisions

AI Job Match Score AI-Powered Job Match Against Your Resume and Optimize Your Resume

Career Tools and Resources

Resume Builder Build Professional Resume with Ease

ATS Friendliness Check Check Resume Friendliness for Applicant Tracking Systems

Auto Apply Apply to hundreds of jobs on any platform effortlessly

Co-Pilot (Chrome Extension) Your AI Assistant for Seamless Browsing Efficiency

Interview Questions Streamline interviews with ready-to-use questions

Salaries Discover market-driven salary insights across skillsets and geographies

Companies Explore leading companies actively hiring talent
For Employers

Home
>
Jobs in hyderabad
>
Oracle
>
Principal Site Reliability Developer

Principal Site Reliability Developer

Oracle

7 - 12 years

20 - 35 Lacs

hyderabad bengaluru

Posted:2 days ago| Platform:

Apply

Skills Required

linux administration infrastructure management python

Work Mode

Hybrid

Job Type

Full Time

Job Description

Role & responsibilities

At Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We act with the speed and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world. Compute is one of the core organisations within OCI. We are responsible for providing Compute power i.e. VMs and BMs. Cloud pretty much cannot exists without our org. The Compute org comprises of a family of critical foundational infrastructure services that drive OCIs hardware lifecycle activities.

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

Responsibilities include but not limited to

Incident Management Support and troubleshooting of Staging/Production environments Response and Resolve incidents as per SLA's Organise, Anticipate, Plan and work as On-Call in shifts for multiple services (Open to work in shifts & shows flexibility) Maintain Service High Availability Release Management Test and Deploy solutions and automate to replace manual processes Build and maintain deployment tools/procedures Zero downtime deployments and a high availability mindset Define and build innovative solution methodologies and assets around infrastructure, cloud migration and deployment operations at scale. Work with service teams to resolve complex issues that require troubleshooting and knowledge of code. Keep documentation up to date and resolving similar tickets with lower turnaround time and within SLA Ensure production security posture Ensure monitoring is robust and effective Change Management Perform Root Cause Analysis

Qualifications:

Bachelors in computer science and Engineering or related engineering fields
7+ years of experience delivering and operating large scale, highly available distributed systems.
5+ years of experience with Linux System Engineering
4+ years of experience with Python/Java building infrastructure Automations
Understanding of Networking, Cloud Computing, Load Balancers
Strong Infrastructure troubleshooting skills
Experience in CICD, Cloud Computing and networking
Hands on experience at Monitoring/Instrumentation tools (Prometheus/Grafana etc)

Preferred candidate profile

More Jobs at Oracle

Java Developer ( Prduct Development - Can join in 30 days) : Bangalore

Bengaluru

8 - 13 yrs

INR 25 - 35 Lacs

Oracle Ebs Finance Functional Consultant

Chennai, Bengaluru, Hyderabad

12 - 20 yrs

INR 20 - 35 Lacs

Programmer Analyst 3-IT

Hyderabad, Telangana, India

Experience: Not specified

Salary: Not disclosed

Senior Applications Developer

Hyderabad, Telangana, India

4 - 4 yrs

Salary: Not disclosed

OBDX/Full Stack Developer- Core Banking

Pune, Bengaluru, Mumbai (All Areas)

8 - 13 yrs

INR 13 - 23 Lacs

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

Oracle

Information Technology

Redwood City

Login to

Please Verify Your Phone or Email

Confirm Action

Principal Site Reliability Developer