Senior Site Reliability Developer

7 - 12 years

9 - 14 Lacs

Posted:1 week ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

At Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We act with the speed and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world. Compute is one of the core organisations within OCI. We are responsible for providing Compute power i.e. VMs and BMs. Cloud pretty much cannot exists without our org. We develop and operate multiple services (Provisioning, Monitor, Repair, Control Plane, Data Plane, Re-imaging etc) behind the scene which work like magic for our customers. We re looking for hands-on engineers with expertise and passion in solving difficult problems in distributed systems, virtualised infrastructure, and highly available services. Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning. You should be expert in Linux, Python/Java, and system engineering experience. You value simplicity and scale, work comfortably in a collaborative, agile environment, and are excited to learn. Qualifications: Bachelors in computer science and Engineering or related engineering fields 7+ years of experience delivering and operating large scale, highly available distributed systems. 5+ years of experience with Linux System Engineering 4+ years of experience with Python/Java building infrastructure Automations 2+ years of DevOps experience Strong Infrastructure troubleshooting and performance tuning skills. Experience in CICD, Cloud Computing and networking Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

Mock Interview

Practice Video Interview with JobPe AI

Start Computer Science Interview Now

My Connections Oracle

Download Chrome Extension (See your connection in the Oracle )

chrome image
Download Now
Oracle
Oracle

Information Technology

Redwood City

135,000 Employees

5543 Jobs

    Key People

  • Safra Catz

    CEO
  • Larry Ellison

    Co-Founder & CTO

RecommendedJobs for You

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru