1 Multi-Gpu Systems Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

8.0 - 12.0 years

75 - 80 Lacs

bengaluru

Work from Office

Key Skills: Triton, C++, GPU Runtime Optimization, Multi-GPU Systems, TVM, XLA, MLIR, ROCm, Transformer Inference. Roles & Responsibilities: Architect high-performance inference runtimes, kernel dispatchers, and memory planners for large diffusion and transformer workloads. Lead investigations into cross-GPU performance bottlenecks, communication overheads, and scheduling inefficiencies. Drive multi-GPU parallelism strategies, including model, pipeline, and tensor parallelization. Establish company-wide GPU optimization standards, tooling, and SLIs. Collaborate with research teams to design scalable implementations of novel architectures. Mentor engineers in profiling, tuning, and low-level ...

Posted 3 weeks ago

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies