Hpc Engineer

4 - 9 years

35 - 40 Lacs

Posted:20 hours ago| Platform: Naukri logo

Apply

Work Mode

Hybrid

Job Type

Full Time

Job Description

Oracle Cloud Infrastructure (OCI) Cluster Networking team is building an ultra-high-performance network to support AI/ML/HPC workloads. Join us to design systems that scale from tens to hundreds of thousands of GPUs without sacrificing performance. Our team develops and tunes the software and hardware stack for distributed workloads using libraries such as NCCL on high-speed networks.

Strong knowledge and practical experience with NCCL is essential for this role. Youll apply collective communication libraries to tune system performance at a previously unheard-of scaleour approach to scaling is cutting edge.

Preferred Qualifications:

  • Bachelors / Masters in Computer Science or related engineering fields
  • Experience with RDMA programming, including but not limited to GPUDirect RDMA
  • Experience with distributed workload managers like Slurm or K8s
  • Experience with Linux Performance tools
  • Experience in SDN, NFV, Cloud Networking
  • Experience in Infrastructure-as-a-Service, viz. OpenStack, AWS, GCP, Azure

Responsibilities

  • 5+ years of experience with software (systems/application) development
  • 2+ years of experience with collective communications libraries like NCCL, RCCL, MPI and GPU frameworks like CUDA and ROCm.
  • 2+ years of experience with ML training frameworks like PyTorch, TensorFlow
  • Proficient at programming in any two out of C/C++, Python, Java, Scala, GO
  • Proficient with data structures, algorithms, operating systems
  • Excellent organizational, verbal, and written communication skills
  • Bachelors in computer science and Engineering or related engineering fields

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your C Skills

Practice C coding challenges to boost your skills

Start Practicing C Now
Oracle logo
Oracle

Information Technology

Redwood City

RecommendedJobs for You