38 Infiniband Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

15.0 - 17.0 years

0 Lacs

india

Remote

AI Centre Ethernet Switching Architect Hyderabad / Anywhere in India (Remote ) Founded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/ Bangalore Our pay comprehensively beats ALL Semiconductor product players in the Indian market. AI Centre Ethernet Switching Architect Position Overview We are seeking a top-notch specialist Architect with over 15 years of experience to join our team in designing and developing Ethernet switches tailored for AI datacenter backend networks. The ideal candidate will have a strong background in digital design, ASIC/FPGA development, Ethernet/TCP/IP protocols, and experience with high-perf...

Posted 18 hours ago

AI Match Score
Apply

2.0 - 4.0 years

0 Lacs

pune, maharashtra, india

Remote

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing. DDN's A3I solutions are transforming the landscape of AI infrastructure. IDC The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments - Marc Hamilton, VP, Solutions Architecture & Eng...

Posted 3 days ago

AI Match Score
Apply

6.0 - 10.0 years

0 Lacs

thiruvananthapuram, kerala, india

On-site

Job Description Bachelor's or Master's degree in Computer Science & Engineering 6-10+ years of professional experience in full stack development, with a proven track record of deploying web applications in production environments. Strong fundamentals in data structures & algorithms demonstrated through complex system design and problem-solving in computer network domain. Experience in at least one backend language (e.g., Java, Python, Go), frontend framework including SQL/NoSQL databases and cloud platforms (e.g., AWS, Azure, GCP). Hands-on experience with observability tools (e.g., Prometheus, Grafana, ELK/EFK, OpenTelemetry). Experience with container orchestration (Kubernetes, Docker) and...

Posted 1 week ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

india

On-site

Bachelor's or Master's degree in Computer Science & Engineering 5+ years of professional experience in full stack development, with a proven track record of deploying web applications in production environments. Strong fundamentals in data structures & algorithms demonstrated through complex system design and problem-solving in computer network domain. Experience in at least one backend language (e.g., Java, Python, Go), frontend framework including SQL/NoSQL databases and cloud platforms (e.g., AWS, Azure, GCP). Hands-on experience with observability tools (e.g., Prometheus, Grafana, ELK/EFK, OpenTelemetry). Experience with container orchestration (Kubernetes, Docker) and CI/CD tools (e.g.,...

Posted 1 week ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

bengaluru, karnataka, india

On-site

Bachelor's or Master's degree in Computer Science & Engineering 5+ years of professional experience in full stack development, with a proven track record of deploying web applications in production environments. Strong fundamentals in data structures & algorithms demonstrated through complex system design and problem-solving in computer network domain. Experience in at least one backend language (e.g., Java, Python, Go), frontend framework including SQL/NoSQL databases and cloud platforms (e.g., AWS, Azure, GCP). Hands-on experience with observability tools (e.g., Prometheus, Grafana, ELK/EFK, OpenTelemetry). Experience with container orchestration (Kubernetes, Docker) and CI/CD tools (e.g.,...

Posted 1 week ago

AI Match Score
Apply

6.0 - 10.0 years

0 Lacs

india

On-site

Bachelor's or Master's degree in Computer Science & Engineering 6-10+ years of professional experience in full stack development, with a proven track record of deploying web applications in production environments. Strong fundamentals in data structures & algorithms demonstrated through complex system design and problem-solving in computer network domain. Experience in at least one backend language (e.g., Java, Python, Go), frontend framework including SQL/NoSQL databases and cloud platforms (e.g., AWS, Azure, GCP). Hands-on experience with observability tools (e.g., Prometheus, Grafana, ELK/EFK, OpenTelemetry). Experience with container orchestration (Kubernetes, Docker) and CI/CD tools (e....

Posted 1 week ago

AI Match Score
Apply

3.0 - 5.0 years

0 Lacs

india

On-site

Basic Qualifications: . BS or MS degree in CS or related engineering or science field with 3+ years of relevant experience . Experience with benchmarking and troubleshooting or optimizing performance of a system. . Experience with coding, scripting, and automation. . Background in Networking. . General Linux skills. . Demonstrated ability to lead complex projects, independently resolve ambiguity, collaborate with stakeholders across teams, and communicate effectively. Desired qualifications: . Experience working on clusters, e.g., running HPC/AI workloads, or maintaining an HPC/AI system. . Experience troubleshooting or tuning performance on distributed systems. . Familiarity with elements o...

Posted 1 week ago

AI Match Score
Apply

6.0 - 8.0 years

0 Lacs

india

On-site

The position requires the engineer to work on qualification of Networking devices, fabric, protocols with a deep level understanding of networking at the protocol level coupled with programming skills to support the automation of test plans. As OCI is a cloud-based network with a global footprint, this support will include evaluation and test of highly scaled up network infrastructure. Qualifications 6+ years of experience in testing and automation of Networking devices at system level Mandatory expertise with traffic generator like IXIA or similar Network Test equipment. Fluency in BGP, MPLS, VxLAN, EVPN OSPF, QoS Networking device hardware Experience in RDMA, RoCE, Infiniband is preferable...

Posted 3 weeks ago

AI Match Score
Apply

6.0 - 8.0 years

0 Lacs

pune, maharashtra, india

On-site

This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing. DDN's A3I solutions are transforming the landscape of AI infrastructure. IDC The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments - Marc Hamilton, VP, Solutions Architecture & Eng...

Posted 3 weeks ago

AI Match Score
Apply

10.0 - 14.0 years

0 Lacs

pune, maharashtra

On-site

As a Storage Professional, System Engineer, your main responsibility will be to develop, support, and maintain the NVMe-over-Fabric (TCP/IP, RDMA/RoCE/IB_verbs) stack. You will be crucial in optimizing and enhancing IO stack performance through the use of SPDK and DPDK to ensure the efficiency and high performance of storage systems. - Develop, support, and maintain the NVMe-over-Fabric (TCP/IP, RDMA/RoCE/IB_verbs) stack. - Focus on IO stack performance optimizations and enhancements using SPDK and DPDK. - Write, review, and maintain high-quality code following industry standards and best practices. - Conduct comprehensive code reviews to maintain consistency and quality in the codebase. - W...

Posted 3 weeks ago

AI Match Score
Apply

8.0 - 12.0 years

0 Lacs

faridabad, haryana

On-site

As a Senior Engineer-HPC at our company, you will be responsible for managing and optimizing large-scale HPC clusters and infrastructure to deliver maximum performance for demanding workloads. Your key responsibilities will include: - Designing, implementing, and maintaining HPC environments, including compute, storage, and network components. - Configuring and optimizing workload managers/schedulers for efficient job scheduling and resource allocation. - Implementing performance tuning for CPU, GPU, memory, I/O, and network subsystems. - Managing HPC filesystem solutions such as Lustre, BeeGFS, or GPFS/Spectrum Scale. In addition to HPC cluster management, you will also be involved in: - Ad...

Posted 1 month ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

pune, maharashtra, india

On-site

8+ years of experience in Compute Hardware troubleshooting. (L3) ? Install, administer, and maintain hardware infrastructure. ? Diagnose and correct system issues, whether these be issues with correct operation or performance. ? Reinstate integrity of system as quickly as possible following an outage in order to minimize downtime. ? Triage and solve user-submitted tickets, especially when they relate to the infrastructure. ? Track resource usage using monitoring and queuing software. ? Actively participate in Knowledge Management by creating new technical documents. ? Patch system firmware and software as needed. ? Peer assistance is an added trait. Technical Skills: Primary : HPE Proliant D...

Posted 1 month ago

AI Match Score
Apply

8.0 - 10.0 years

0 Lacs

bengaluru, karnataka, india

On-site

8+ years of experience in Compute Hardware troubleshooting. (L3) ? Install, administer, and maintain hardware infrastructure. ? Diagnose and correct system issues, whether these be issues with correct operation or performance. ? Reinstate integrity of system as quickly as possible following an outage in order to minimize downtime. ? Triage and solve user-submitted tickets, especially when they relate to the infrastructure. ? Track resource usage using monitoring and queuing software. ? Actively participate in Knowledge Management by creating new technical documents. ? Patch system firmware and software as needed. ? Peer assistance is an added trait. Technical Skills: Primary : HPE Proliant D...

Posted 1 month ago

AI Match Score
Apply

7.0 - 15.0 years

0 Lacs

chennai, tamil nadu

On-site

As a Software Architect at Applied Materials, you will be responsible for designing and implementing high-performance computing software solutions for the organization. Your role will involve working closely with cross-functional teams to understand requirements and translate them into architectural/software designs that meet business needs. You will also code, develop quick prototypes, and act as a subject matter expert to unblock software engineers in the HPC domain. Additionally, you will profile systems, optimize workflows, and provide guidance to software engineers during the development process. Your primary focus will be on ensuring that the software systems are scalable, reliable, ma...

Posted 1 month ago

AI Match Score
Apply

6.0 - 8.0 years

0 Lacs

bengaluru, karnataka, india

On-site

The position requires the engineer to work on qualification of Networking devices, fabric, protocols with a deep level understanding of networking at the protocol level coupled with programming skills to support the automation of test plans. As OCI is a cloud-based network with a global footprint, this support will include evaluation and test of highly scaled up network infrastructure. Qualifications 6+ years of experience in testing and automation of Networking devices at system level Mandatory expertise with traffic generator like IXIA or similar Network Test equipment. Fluency in BGP, MPLS, VxLAN, EVPN OSPF, QoS Networking device hardware Experience in RDMA, RoCE, Infiniband is preferable...

Posted 1 month ago

AI Match Score
Apply

5.0 - 10.0 years

5 - 10 Lacs

gurgaon, haryana, india

On-site

The High-Performance Computing Network Engineer is primarily responsible for the overall health and maintenance of storage technologies in our managed services customer's environments. Our Network Engineers are a valued member of the Managed Services Infrastructure Practice responsible for Tier 3 incident management, service request management and change management infrastructure support for all Managed Services customers. Key Responsibilities Provide enterprise-level operational support to Managed Services customers for incident, problem, and change management activities Plan and perform maintenance activities Assess customer environments for performance and design issues and propose resolu...

Posted 1 month ago

AI Match Score
Apply

5.0 - 10.0 years

0 Lacs

hyderabad, telangana

On-site

Join our ambitious team of silicon and hyperscale data center systems experts as a Physical Design Engineer. Our mission is to revolutionize the performance and scalability of next-generation distributed computing infrastructure. You will have the opportunity to work on groundbreaking products and collaborate with talented hardware and software engineers to create disruptive infrastructure solutions that excite our customers. We are seeking talented engineers experienced in physically implementing large-scale networking and computing semiconductor products. You will be part of a dynamic startup environment and contribute to the full lifecycle of complex chip development, from CAD tool flow s...

Posted 1 month ago

AI Match Score
Apply

5.0 - 10.0 years

0 - 3 Lacs

bengaluru

Hybrid

Were Hiring – HPC/Linux Cluster Admin Bangalore (Hybrid) | 5+ Yrs Exp | Immediate Joiner Skills Required: Linux Admin & HPC Clusters (SLURM/PBS). VMware (ESXi, vCenter, Horizon, SRM). Networking (InfiniBand, 10/40G Ethernet). Storage (Lustre, GPFS, SAN/NAS). Scripting (Bash/Python). Share your CV at huzaifa@xevyte.com

Posted 2 months ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

pune, maharashtra, india

On-site

We are seeking software engineers to work on next-generation graphics and computing products. Our charter is to build the most stressful set of applications a GPU or high performance computing server would see in its life cycle. The best candidates will have strong C/C++ programming skills, thorough knowledge of graphics concepts and algorithms, a solid foundation of systems software with emphasis on OS fundamentals, and a deep understanding of current generation PC/hardware/embedded architecture. Excellent communication skills and a dedication to meticulous engineering practices are a requirement. As a system software engineer, you will extensively use your knowledge of operating systems, a...

Posted 2 months ago

AI Match Score
Apply

2.0 - 4.0 years

0 Lacs

bengaluru, karnataka, india

On-site

At NVIDIA, we build groundbreaking products for the following sectors: Gaming, Deep Learning, Automotive, Embedded and High Performance Computing. As a member of the GPU/SoC Foundations Developer Tools team, you will be advancing the state of art of our low-level profiling library which aids developers in analyzing and optimizing the performance of their systems/applications. We are seeking a motivated Software Engineer to contribute to the performance triage development and co-design of our software in collaboration with our Hardware Architecture team. Join our team and gain exciting opportunities to work hands-on at every layer of NVIDIA's world-class technology. What you'll be doing: Work...

Posted 2 months ago

AI Match Score
Apply

6.0 - 11.0 years

9 - 15 Lacs

bangalore rural, bengaluru

Work from Office

Job Description: 8+ years of experience in managing Linux setup. 4+ years of Experience in HPC/ Linux clusters. Install, administer, and maintain hardware, system software, networking, accounts, and security measures on VMWare configuration. Diagnose and correct system issues, whether these be issues with correct operation or performance. Reinstate integrity of system as quickly as possible following an outage in order to minimize downtime. Triage and solve user-submitted tickets, especially when they relate to the infrastructure. Track resource usage using monitoring and queuing software. Actively participate in Knowledge Management by creating new technical documents. Patch system firmware...

Posted 2 months ago

AI Match Score
Apply

5.0 - 7.0 years

0 Lacs

bengaluru, karnataka, india

On-site

WEKA is architecting a new approach to the enterprise data stack built for the age of reasoning. NeuralMesh by WEKA sets the standard for agentic AI data infrastructure with a cloud and AI-native software solution that can be deployed anywhere. It transforms legacy data silos into data pipelines that dramatically increase GPU utilization and make AI model training and inference, machine learning, and other compute-intensive workloads run faster, work more efficiently, and consume less energy. WEKA is a pre-IPO, growth-stage company on a hyper-growth trajectory. Weve raised $375M in capital with dozens of world-class venture capital and strategic investors. We help the worlds largest and most...

Posted 2 months ago

AI Match Score
Apply

2.0 - 7.0 years

3 - 8 Lacs

bengaluru, mumbai (all areas)

Work from Office

2+ years in HPC or distributed computing Strong Linux and scripting skills Experience with schedulers, parallel file systems, and HPC networking Familiarity with containers and automation tools Bonus: Cloud HPC, AI/ML tools, performance profiling

Posted 2 months ago

AI Match Score
Apply

6.0 - 10.0 years

0 Lacs

noida, uttar pradesh

On-site

We are looking for a highly skilled HPC/GPU Operations Engineer with 6-10 years of experience to oversee the management, optimization, and maintenance of high-performance computing (HPC) infrastructure, specifically focusing on GPU-accelerated workloads. Your primary responsibilities will include ensuring the reliability, efficiency, and scalability of HPC systems used for scientific computing, AI/ML, and data-intensive applications. As an IC3 level professional, you will be tasked with the following responsibilities: - Managing and administering HPC clusters, GPU nodes, and high-speed interconnects. - Configuring GPU-accelerated workloads for AI/ML, scientific computing, and simulations. - ...

Posted 3 months ago

AI Match Score
Apply

7.0 - 11.0 years

0 Lacs

chennai, tamil nadu

On-site

You will be part of the AI/HPC engineering team specializing in platform standardization initiatives, innovation, testing, and optimization of various AI technologies. Your role will involve installation, administration, troubleshooting, and analytical skills in technology stacks such as Linux, Kubernetes, SLUM, Nvidia BCM, and open-source infrastructure tools like Ansible and scripting. As a qualified candidate with a B.E/B.Tech degree and over 7+ years of experience in the IT Infrastructure industry, including 7 to 8 years in HPC and/or AI technology, you should possess a strong knowledge of scripting and Linux, with a minimum of 2 years in Kubernetes. Your responsibilities will include ma...

Posted 3 months ago

AI Match Score
Apply
Page 1 of 2
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies