Site Reliability Developer (Networking)

0 years

0 Lacs

Posted:1 week ago| Platform: Foundit logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a

Network Reliability Engineer

on the OCI Network Availability team, you will play a crucial role in ensuring the high availability and performance of Oracle Cloud's global network infrastructure. This role involves applying engineering methodologies to measure, monitor, and automate the reliability of OCI's network, supporting millions of users across a vast, distributed environment.You will be part of a fast-paced, innovative team responsible for swiftly responding to network disruptions, identifying root causes, and collaborating with both internal and external stakeholders to restore services. Your work will also focus on automating daily operations, improving workflow efficiency, and optimizing network performance. With OCI's expansive global footprint, you will manage hundreds of thousands of network devices across a mix of dedicated backbone infrastructure, CLoS networks, and the internet.

Responsibilities

RESPONSIBILITIES

Support and Operate OCI's Global Network:

Design, deploy, and manage large-scale network solutions that power Oracle Cloud Infrastructure (OCI), ensuring reliability and performance at a global scale.

Collaborate and Drive Change:

Use best practices and tools to develop and execute network changes safely. Work closely with cross-functional teams to continuously improve network performance.

Incident Response and Troubleshooting:

Lead break-fix support for network events, provide escalation for complex issues, and perform post-event root cause analysis to prevent future disruptions.

Automation and Efficiency:

Create and maintain scripts to automate routine network tasks, working with business units and teams to streamline operations and increase productivity.

Mentorship and Knowledge Sharing:

Guide and mentor junior engineers, fostering a culture of collaboration, continuous learning, and technical excellence.

Network Monitoring and Performance Analysis:

Collaborate with network monitoring teams to gather telemetry data, build dashboards, and set up alert rules to track network health and performance.

Vendor Collaboration:

Work with network vendors and technical account teams to resolve network issues, qualify new firmware/operating systems, and ensure the network ecosystem's stability.

On-Call Support:

Participate in the on-call rotation to provide after-hours support for critical network events, ensuring that operational excellence is maintained 24/7.

Experience

:Experience working in a large-scale

ISP

or

cloud provider

environment, supporting global network infrastructure is a plus.Prior experience in a

network operations

role, with a proven track record of handling complex network events.

Technical Skills

:Strong proficiency in

network protocols

and services, including

MPLS, BGP, OSPF, IS-IS, TCP/IP, IPv4/IPv6, DNS, DHCP, VxLAN, and EVPN

.Experience with

network automation

, scripting, and data center design.

Python

is preferred, though expertise in other scripting or compiled languages is a plus.Hands-on experience with

network monitoring and telemetry solutions

, with the ability to leverage these tools to drive improvements in network reliability.Familiarity with

network modeling and programming

, including

YANG, OpenConfig, and NETCONF

.

Problem-Solving and Collaboration

:Ability to apply

engineering principles

to resolve complex network issues, collaborating across teams to deliver effective solutions.Strong

communication skills

, both written and verbal, with the ability to present technical information clearly to both technical and non-technical stakeholders.Demonstrated experience in influencing product roadmap decisions, priorities, and feature development through sound judgment and technical expertise.

What We Offer:

Impactful Work:

Work on projects that influence the future of cloud technology, supporting millions of users and businesses globally.

Innovation-Driven Culture:

Be part of a team that thrives on creativity, continuous learning, and pushing the boundaries of what's possible.

Career Growth:

We're committed to your professional development and offer opportunities to expand your skills and take on new challenges.

Collaborative Environment:

Join a diverse, supportive team where autonomy and innovation are encouraged, and your contributions are valued.

Additional Information:

This role involves participation in an

on-call rotation

, providing 24/7 support for critical network events and incidents.You will have the opportunity to work in a

highly dynamic

environment with exposure to cutting-edge technologies and large-scale cloud infrastructure.

Qualifications

Career Level - IC1

About Us

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sectorand continue to thrive after 40+ years of change by operating with integrity.We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing [HIDDEN TEXT] or by calling +1 888 404 2494 in the United States.Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now
Oracle logo
Oracle

Information Technology

Redwood City

RecommendedJobs for You