1 Pytorch Extensions Jobs

Setup a job Alert
JobPe aggregates results for easy application access, but you actually apply on the job portal directly.

5.0 - 8.0 years

55 - 60 Lacs

bengaluru

Work from Office

Key Skills: CUDA, GPU Kernels, C++, Python, GPU Optimization, Triton, ROCm, PyTorch Extensions, Distributed Inference, Mixed Precision. Roles & Responsibilities: Develop, optimize, and maintain GPU kernels (CUDA, Triton, ROCm) for diffusion, attention, and convolution operators in generative AI models. Profile end-to-end inference pipelines (data movement, kernel scheduling, memory transfers) to identify and resolve performance bottlenecks. Apply optimization techniques such as operator fusion, tiling, caching, and mixed-precision compute to maximize GPU throughput. Collaborate with research teams to productionize experimental layers or model architectures. Build benchmarking tools and micro...

Posted 10 hours ago

AI Match Score
Apply
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

Featured Companies