Site Reliability Developer 3

10 years

0 Lacs

Posted:2 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

OCI is Oracle’s next-generation cloud platform, built for the most demanding enterprise workloads. We deliver high-performance computing, storage, networking, and platform services at global scale.The AI Platform, Services & Solutions organization within OCI is building the foundation for enterprise AI—spanning GPU infrastructure, training pipelines, orchestration systems, and model deployment services. As part of this mission, we are looking for a

Senior Site Reliability Engineer (SRE)

to join our team and take ownership of managing and evolving our OKE (Oracle Kubernetes Engine) infrastructure.This is a hands-on, high-impact role where you will be responsible for ensuring the reliability, scalability, and security of cloud-scale services that power AI workloads across Oracle Cloud.

Qualifications

  • 4–10 years of experience in site reliability, DevOps, or systems engineering.
  • Strong background in operating large-scale, distributed, and highly available systems.
  • Proficient with Linux, Python, and shell scripting.
  • Hands-on experience with Kubernetes (OKE, EKS, GKE, or similar) and Docker.
  • Experience with Infrastructure as Code (Terraform, Ansible, etc.) on a major cloud provider.
  • Knowledge of cloud networking, security, and routing (VPC, CIDR, security groups).
  • Familiarity with observability tools (Prometheus, Elasticsearch, Fluentd, Grafana).
  • Experience with CI/CD pipelines, git workflows, and agile development.
  • Understanding of disaster recovery, redundancy, and operational uptime planning.
  • Strong troubleshooting, problem-solving, and communication skills.
  • BS/MS in Computer Science or equivalent experience.

Desired Attributes

  • Resourceful and pragmatic in solving operational challenges.
  • Strong focus on automating repetitive tasks and reducing toil.
  • Committed to shared responsibility and improving the on-call experience.
  • Detail-oriented with strong critical-thinking skills.
  • Eager to learn and to mentor others in a collaborative environment.

Responsibilities

  • Design, automate, and operate infrastructure resources in OCI (compute, storage, networking, load balancing).
  • Manage large-scale OKE clusters and containerized workloads.
  • Build automation for service provisioning, monitoring, and lifecycle management.
  • Develop dashboards, alerts, runbooks, and tooling to improve observability and reliability.
  • Troubleshoot and resolve complex production issues with a focus on resilience and uptime.
  • Contribute to service authentication, authorization, and security best practices.
  • Collaborate with software and ML engineers to deliver highly available AI infrastructure.
  • Participate in on-call rotations and improve incident response processes.

Qualifications

Career Level - IC3

About Us

As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Mock Interview

Practice Video Interview with JobPe AI

Start DevOps Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Oracle logo
Oracle

Information Technology

Redwood City

RecommendedJobs for You

noida, uttar pradesh, india

hyderabad, telangana, india