Principal Network Reliability Engineer

5 - 9 years

0 Lacs

Posted:3 days ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: As a Network Reliability Engineer (NRE) at Oracle Cloud, your primary responsibility will be to ensure the robustness of the Oracle Cloud Network Infrastructure. You will be applying an engineering approach to measure and automate network reliability to meet the organization's service-level objectives, agreements, and goals. Your duties will involve promptly responding to network disruptions, identifying the root cause, and collaborating with internal and external stakeholders to restore functionality efficiently. Your role will be crucial in automating recurring tasks to enhance workflow efficiency and productivity in daily operations. Key Responsibilities: - Design, write, and deploy network monitoring and automation software to enhance the availability, scalability, and efficiency of Oracle products and services. - Work with the Site Reliability Engineering (SRE) team on shared full stack ownership of services and technology areas. - Support the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI) with a focus on network fabric and systems. - Collaborate with program/project managers to develop milestones and deliverables. - Develop solutions to enable front-line support teams to act on network failure conditions. - Mentor junior engineers and participate in network solution and architecture design process. - Provide break-fix support for events, lead post-event root cause analysis, and automate routine tasks through scripting. - Coordinate with networking automation services and network monitoring to gather telemetry, create alerts, and build dashboards for network issue identification. - Serve as the subject matter expert (SME) on software development projects for network automation and monitoring. - Collaborate with network vendor technical account teams and internal Quality Assurance teams to drive bug resolution and qualify new firmware and operating systems. Qualifications Required: - Bachelor's degree in Computer Science or related engineering field with 8+ years of Network Engineering experience or Master's with 5+ years of Network Engineering experience. - Experience working in a large ISP or cloud provider environment and in a network operations role. - Strong knowledge of protocols such as MPLS, BGP, IPv6, DNS, DHCP, SSL, VxLAN, and EVPN. - Deeper understanding of Data Center build and design, including CLoS architecture. - Extensive experience with scripting or automation, with expertise in Python or other scripting languages. - Hands-on experience with network monitoring and telemetry solutions like Prometheus. - Familiarity with network modeling and programming languages such as YANG, OpenConfig, and NETCONF. - Excellent organizational, verbal, and written communication skills, with the ability to resolve complex issues creatively and effectively. - Capable of working under limited supervision and participating in an on-call rotation. (Note: Additional details about the company were not included in the provided job description.),

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
Oracle logo
Oracle

Information Technology

Redwood City

RecommendedJobs for You