Posted:1 day ago|
Platform:
On-site
Full Time
The AI2NE Org strives to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated High-Performance Compute (HPC), Artificial Intelligence and Machine Learning advancements. We envision a future where artificial intelligence and machine learning revolutionize industries, reshape societies, and unlock limitless possibilities. Our vision is to be a pioneering force, driving the development and design of state-of-the-art RDMA clusters tailored specifically for AI, ML, HPC workloads. We strive to be the go-to experts in RDMA cluster architecture, leveraging our deep understanding of the unique demands of AI/ML and HPC applications. By staying at the forefront of technological advancements, we aim to redefine the boundaries of what is possible, pushing the envelope of computational capabilities and unlocking unprecedented performance. Supports the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI). Primarily focused on the development and support of network fabric and systems through a combination of a deep level understanding of networking at the protocol level coupled with programming skills. As OCI is a cloud-based network with a global footprint, this support will include hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure, CLos Network, and the Internet. Collaborate with program/project managers to develop milestones and deliverables. Will primarily use existing procedures and tools to develop and safely execute network change. However, may have to develop new procedures from time to time. Develop solutions to enable front line support teams to act on network failure conditions. Mentor junior engineers. Participates in network solution and architecture design process and contribute to the roadmaps development. Participate in operational rotations as either primary or secondary. Provide break-fix support for events. Serve as the escalation point for event remediation. Lead post-event root cause analysis. Frequently develops scripts to automate routine tasks for team and business units. Coordinate with networking automation services for the development and integration of support tooling. Coordinate with network monitoring to gather telemetry and create alerts rules using them. Build dashboards to represent data at various network layers and device roles that help identify network issues, anomalies. Serves as SME on software development projects for network automation and network monitoring. Collaborate with network vendor technical account team and internal Quality Assurance team to drive bug resolution and assist in the qualification of new firmware and/or operating systems. Qualifications: Bachelor's degree in CS or related engineering field with 5+ years of Network Engineering experience or master's with 3+ years of Network Engineering experience. Experience working in a large ISP or cloud provider environment. Experience in RDMA Networking is a plus. Experience working in a network operations role. Folks with strong knowledge of protocols such as MPLS, BGP/OSPF/IS-IS, TCP, IPv4, IPv6, DNS, and DHCP. Also, VxLAN and EVPN will be an added advantage. Extensive experience with scripting or automation and data center design - Python preferred but must demonstrate expertise in scripting or compiled language. Experience with networking protocols such as TCP/IP, VPN, DNS, DHCP, and SSL. Experience with network monitoring and telemetry solutions. Experience with network modeling and programming - YANG, OpenConfig, NETCONF. Ability to use professional concepts and company objectives to resolve complex issues in creative and effective ways. Capable of working under limited supervision. Excellent organizational, verbal, and written communication skills. Excellent judgment in influencing product roadmap direction, features, and priorities. Participate in an on-call rotation. Career Level - IC3
Oracle
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Bengaluru
7.0 - 11.0 Lacs P.A.
Bengaluru / Bangalore, Karnataka, India
Salary: Not disclosed
Bengaluru / Bangalore, Karnataka, India
Salary: Not disclosed
Ahmedabad, Gujarat, India
Salary: Not disclosed
Noida, Uttar Pradesh, India
Salary: Not disclosed
Chennai, Tamil Nadu, India
Salary: Not disclosed
Hyderabad, Telangana, India
Salary: Not disclosed
Mumbai, Hyderabad, Bengaluru
7.0 - 12.0 Lacs P.A.