Site Reliability Developer 2

3 - 5 years

0 Lacs

Posted:1 day ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

This role will be responsible for deploying, administering, securing, and the reliability of production systems in OCI and traditional data centers.

  • Design, implement, and maintain Infrastructure as Code (Terraform) to provision and modify production environments via Git-based change control (PRs, reviews, CI/CD) aligned with change management policies.
  • Administer and optimize Oracle Cloud Infrastructure (OCI): compute, networking, storage, IAM/policies, compartments, tagging, observability, and cost controls.
  • Install, upgrade, configure, and patch enterprise database platforms across dev/test/prod validate backup/restore and maintain configuration baselines.
  • Implement and maintain advanced database security: least-privilege IAM, encryption in transit/at rest, auditing, key/secret management, data masking, and compliance controls.
  • Build and enhance automation with Python, Bash, and Terraform to reduce toil, standardize workflows, and create reusable modules and pipelines.
  • Establish and improve observability: SLIs/SLOs, actionable alerts, dashboards, logging/metrics/tracing, and runbooks to reduce noise and MTTR.
  • Conduct proactive and reactive database monitoring and maintenance: capacity and health checks, statistics management, patching, space/index management.
  • Design, configure, monitor, and maintain database replication and HA/DR solutions regularly test failover and validate RTO/RPO objectives.
  • Troubleshoot complex infrastructure and database alerts/incidents perform root cause analysis implement corrective and preventive actions automate remediation where feasible.
  • Optimize availability, capacity, and performance through query tuning, execution plan management, resource governance, and system-level tuning.
  • Uphold security, privacy, and compliance standards enforce least-privilege access, vulnerability remediation, patch governance, and backup/DR readiness.
  • Document standards, runbooks, and architectural diagrams contribute to postmortems drive continuous improvement across reliability, performance, and cost.
  • Participate in on-call rotations and support incident response, problem management, and change reviews.

Career Level - IC2

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Oracle logo
Oracle

Information Technology

Redwood City

RecommendedJobs for You

noida, uttar pradesh, india

bengaluru, karnataka, india

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru