Get alerts for new jobs matching your selected skills, preferred locations, and experience range. Manage Job Alerts
5.0 - 9.0 years
0 Lacs
ahmedabad, gujarat
On-site
The Network Reliability Engineering (NRE) team at Oracle Cloud is responsible for ensuring the robustness of the Oracle Cloud Network Infrastructure. As a Network Reliability Engineer (NRE), your primary focus will be on applying an engineering approach to measure and automate the network's reliability to align with the organization's service-level objectives, agreements, and goals. Your duties will involve promptly responding to network disruptions, identifying the root cause, and collaborating with internal and external stakeholders to fully restore functionality. Automation of recurring tasks in daily operations is a critical aspect to streamline processes, enhance workflow efficiency, and increase overall productivity. Given that Oracle Cloud Infrastructure (OCI) is a cloud-based network with a global footprint, your support will encompass a vast network of devices and servers connected over various infrastructure types. Your responsibilities will include designing, writing, and deploying network monitoring and automation software to improve the availability, scalability, and efficiency of Oracle products and services. To qualify for this role, you should hold a Bachelor's degree in Computer Science or a related engineering field with at least 5 years of Network Engineering experience, or a Master's degree with the same level of experience. Experience working in a large ISP or cloud provider environment, along with expertise in protocols such as MPLS, BGP, IPv6, DNS, DHCP, SSL, VxLAN, and EVPN, will be highly beneficial. Proficiency in scripting or automation, particularly with Python, is preferred. Additionally, hands-on experience with network monitoring solutions like Prometheus and familiarity with network modeling and programming tools such as YANG, OpenConfig, and NETCONF are expected. In this role, you will work closely with the Site Reliability Engineering (SRE) team on the shared full stack ownership of services and technology areas. You will support the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure, focusing on network fabric and systems. Your collaboration with program/project managers, participation in operational rotations, and contribution to network solution design processes will be key aspects of your responsibilities. As a member of the NRE team, you will have the opportunity to mentor junior engineers, develop automation scripts, coordinate with networking automation services, build dashboards for network monitoring, and serve as a subject matter expert on software development projects. Your role will involve collaborating with network vendors and internal teams to drive bug resolution and enhance network automation and monitoring. At Oracle, we are committed to fostering an inclusive workforce that promotes opportunities for all individuals. We offer competitive benefits, support work-life balance, and encourage employees to give back to their communities through volunteer programs. If you require accessibility assistance or accommodation for a disability during the employment process, please reach out to us. Join us at Oracle and be part of a global team that thrives on innovation and integrity.,
Posted 6 days ago
20.0 - 24.0 years
0 Lacs
karnataka
On-site
As a passionate Platform and Site Reliability Engineering (SRE) leader, you will be responsible for collaborating with multiple organizations and platform teams at Walmart to drive innovation, build solutions, and enhance product adoption. Your role will involve providing product evangelism, engineering expertise in the SRE space, and developing a support organization leadership and vision. You will work within Walmart's Global Tech Platform, Platform Service Delivery, and Operations team to build and maintain foundational technologies supporting the tech organization. This includes data platforms, enterprise architecture, DevOps, cloud computing, and infrastructure, all aimed at delivering a resilient, scalable, and efficient platform to power Walmart's next retail disruption. Your responsibilities will include strategizing high-level frameworks for evaluating, building, and managing a portfolio of SRE products, collaborating with platform engineering teams to meet defined SLO/SLI targets, and acting as a brand ambassador for SRE products across all Walmart segments. Additionally, you will build strong relationships with different business segments, provide consulting services throughout the application development lifecycle, and offer architectural guidance and best practices for product adoption. To be successful in this role, you should have at least 20 years of experience in platform and product development, with a focus on SRE tools and solutions. Your expertise should span platform product development, defining and designing SRE tools, and building scalable platform products in cloud-native environments. You should possess strong technical acumen, the ability to lead product design conversations, and experience in overseeing large-scale support and operations teams. The ideal candidate will be energetic, self-motivated, and adept at problem-solving in a fast-paced environment. You should have a deep interest in technology, cloud computing, and a strong consulting and relationship-building skill set. Your role will contribute to shaping the strategic direction of the Support Center of Excellence and driving continuous improvement in support and operations for platform products at Walmart. In summary, as a Platform and SRE leader at Walmart, you will play a pivotal role in driving innovation, enhancing product adoption, and ensuring a seamless experience for both employees and customers across various Walmart segments. Your expertise and leadership will be instrumental in shaping the future of platform products and services at one of the world's leading retailers.,
Posted 1 week ago
5.0 - 9.0 years
0 Lacs
noida, uttar pradesh
On-site
The Network Reliability Engineering (NRE) team at Oracle Cloud is responsible for ensuring the robustness of the network infrastructure. As a Network Reliability Engineer (NRE), your primary focus will be on applying an engineering approach to measure and automate network reliability to meet the organization's service-level objectives, agreements, and goals. You will be expected to promptly respond to network disruptions, identify root causes, and collaborate with internal and external stakeholders to restore functionality efficiently. Automation of recurring tasks and streamlining processes to enhance workflow efficiency and productivity will be key responsibilities. With OCI being a cloud-based network with a global presence, you will be supporting a vast network infrastructure comprising numerous devices and servers. Your duties will involve designing, writing, and deploying network monitoring and automation software to improve the availability, scalability, and efficiency of Oracle products and services. Requirements: - Bachelor's degree in Computer Science or a related engineering field with 5+ years of Network Engineering experience or Master's degree with 5+ years of Network Engineering experience. - Experience working in a large ISP or cloud provider environment. - Strong knowledge of protocols such as MPLS, BGP, IPv6, DNS, DHCP, SSL, VxLAN, and EVPN. - Deeper understanding of Data Center build and design, including CLoS architecture. - Extensive experience with scripting or automation, preferably in Python, and familiarity with network monitoring and telemetry solutions. - Ability to use professional concepts to resolve complex issues creatively and effectively, with excellent organizational, verbal, and written communication skills. - Participation in an on-call rotation. Responsibilities: - Collaborate with the Site Reliability Engineering (SRE) team on full stack ownership of services and technology areas. - Support the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI) with a focus on network fabric and systems. - Work with program/project managers to develop milestones and deliverables. - Develop solutions for front line support teams to address network failure conditions and mentor junior engineers. - Participate in network solution and architecture design processes and operational rotations. - Provide break-fix support, lead post-event root cause analysis, and automate routine tasks through scripting. - Coordinate with networking automation services and network monitoring for support tooling development and integration. - Serve as a Subject Matter Expert (SME) on software development projects for network automation and monitoring. Qualifications: - Career Level - IC3 About Oracle: Oracle is a global leader in cloud solutions, leveraging tomorrow's technology to address today's challenges. With a commitment to inclusivity, Oracle fosters an empowering work environment that promotes opportunities for all. Offering competitive benefits, flexible medical, life insurance, retirement options, and volunteer programs, Oracle supports its employees in giving back to their communities. For accessibility assistance or accommodations related to disabilities during the employment process, please contact accommodation-request_mb@oracle.com or call +1 888 404 2494 in the United States.,
Posted 1 week ago
6.0 - 10.0 years
20 - 30 Lacs
Hyderabad
Hybrid
Key Skills: Java, Spring Boot, Microsoft Azure, RabbitMQ, Azure Service Bus, Kafka, SQL/NoSQL databases, BPMN, logging frameworks, telemetry solutions. Roles & Responsibilities: Develop and review high-quality, maintainable backend code. Collaborate with cross-functional teams including requirements engineers, QA specialists, and other application developers. Contribute to architectural decisions and maintain associated technical documentation. Integrate backend services with shared platforms such as messaging systems, BPMN workflows, logging frameworks, and telemetry solutions. Stay updated on emerging technologies and Generative AI to improve solutions continuously. Experience Requirement: 6-10 years of experience in software development across the entire software delivery lifecycle. Hands-on expertise in Java and Spring Boot. Previous experience with public cloud platforms, specifically Microsoft Azure. Familiarity with modern architecture involving synchronous and asynchronous integrations using RabbitMQ, Azure Service Bus, or Kafka. Proficiency in working with relational and/or document-based databases, including domain model design and physical implementation. Comfortable working in IDEs and handling large, legacy codebases. Education: Any Post Graduation, Any Graduation.
Posted 2 weeks ago
5.0 - 7.0 years
0 Lacs
Bengaluru / Bangalore, Karnataka, India
On-site
The AI2NE Org strives to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated High-Performance Compute (HPC), Artificial Intelligence and Machine Learning advancements. We envision a future where artificial intelligence and machine learning revolutionize industries, reshape societies, and unlock limitless possibilities. Our vision is to be a pioneering force, driving the development and design of state-of-the-art RDMA clusters tailored specifically for AI, ML, HPC workloads. We strive to be the go-to experts in RDMA cluster architecture, leveraging our deep understanding of the unique demands of AI/ML and HPC applications. By staying at the forefront of technological advancements, we aim to redefine the boundaries of what is possible, pushing the envelope of computational capabilities and unlocking unprecedented performance. Supports the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI). Primarily focused on the development and support of network fabric and systems through a combination of a deep level understanding of networking at the protocol level coupled with programming skills. As OCI is a cloud-based network with a global footprint, this support will include hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure, CLos Network, and the Internet. Collaborate with program/project managers to develop milestones and deliverables. Will primarily use existing procedures and tools to develop and safely execute network change. However, may have to develop new procedures from time to time. Develop solutions to enable front line support teams to act on network failure conditions. Mentor junior engineers. Participates in network solution and architecture design process and contribute to the roadmaps development. Participate in operational rotations as either primary or secondary. Provide break-fix support for events. Serve as the escalation point for event remediation. Lead post-event root cause analysis. Frequently develops scripts to automate routine tasks for team and business units. Coordinate with networking automation services for the development and integration of support tooling. Coordinate with network monitoring to gather telemetry and create alerts rules using them. Build dashboards to represent data at various network layers and device roles that help identify network issues, anomalies. Serves as SME on software development projects for network automation and network monitoring. Collaborate with network vendor technical account team and internal Quality Assurance team to drive bug resolution and assist in the qualification of new firmware and/or operating systems. Qualification: Bachelor's degree in CS or related engineering field with 5+ years of Network Engineering experience or master's with 3+ years of Network Engineering experience. Experience working in a large ISP or cloud provider environment. Experience in RDMA Networking is a plus. Experience working in a network operations role. Folks with strong knowledge of protocols such as MPLS, BGP/OSPF/IS-IS, TCP, IPv4, IPv6, DNS, and DHCP. Also, VxLAN and EVPN will be an added advantage. Extensive experience with scripting or automation and data center design - Python preferred but must demonstrate expertise in scripting or compiled language. Experience with networking protocols such as TCP/IP, VPN, DNS, DHCP, and SSL. Experience with network monitoring and telemetry solutions. Experience with network modeling and programming - YANG, OpenConfig, NETCONF. Ability to use professional concepts and company objectives to resolve complex issues in creative and effective ways. Capable of working under limited supervision. Excellent organizational, verbal, and written communication skills. Excellent judgment in influencing product roadmap direction, features, and priorities. Participate in an on-call rotation Career Level - IC3
Posted 1 month ago
Upload Resume
Drag or click to upload
Your data is secure with us, protected by advanced encryption.
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.